GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



March 1, 2005, 08:40:57 ; Search time 199.724 Seconds 

(without alignments) 
1690.542 Million cell updates/sec 

US-10-624-932-2_COPY_26_898 
4 660 

1 QQ SAT VAN P VP GAN P DLL PH AVAGLGQPDAGLFTVSEAEC 873 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



2105692 



Searched: 2105692 seqs, 386760381 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : A_Geneseq_16Dec04 : * 

1: geneseqpl980s : * 

2: geneseqpl990s : * 

3: geneseqp2000s : * 

4: geneseqp2001s : * 

5: geneseqp2002s : * 

6 : geneseqp2003as : * 

7 : geneseqp2003bs : * 

8: geneseqp2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



% 

Query 



No. 


Score 


Match 


Length DB 


ID 


Description 




1 


4660 


100.0 


898 


5 


AAU85403 


Aau85403 Human 


pro 


2 


4660 


100.0 


898 


8 


ADH71618 


Adh71618 Human 


pro 


3 


4650 


99.8 


898 


5 


AAU97899 


Aau97899 Human 


net 


4 


4567.5 


98.0 


899 


5 


AAU79939 


Aau79939 Human 


UNC 


5 


4567.5 


98.0 


899 


7 


ADG42569 


Adg42569 Novel 


hum 


6 


4567.5 


98.0 


899 


8 


ADH71610 


Adh71610 Human 


pro 


7 


4567.5 


98.0 


899 


8 


ADH71628 


Adh71628 Human 


pro 


8 


4566.5 


98.0 


899 


8 


ADH71636 


Adh71636 Human 


pro 



Q 

y 


A c CL A Q 

4Db4 . 0 


Q Q 


U 


yy y 


o 
8 


ADn / lb42 


Aah71642 


Human pro 


1U 


A R a A c 
40b4 . D 


Q Q 

y b . 


U 


by 9 


8 


ADH /1640 


Adh71640 


Human pro 


11 


4DDO . O 


Q"7 

y / . 


Q 

y 


by y 


o 
8 


ADH /lo32 


Aah71632 


Human pro 


12 


4dd3 . o 


97 . 


9 


o o o 

899 


8 


ADH71638 


Adh7l638 


Human pro 


1J 


/I CCO c 
40DZ . O 


y / . 


y 


OQQ 

by y 


o 
8 


ADH /lo34 


Aah71634 


Human pro 


1 A 
14 


4 DbU . 3 


y / . 


9 


O O Q 


o 
8 


ADH / 164 8 


Adh71648 


Human pro 


ID 


40 bU . o 


y / . 


Q 

y 


OQQ 

o y y 


Q 

8 


7\ ntn i eon 
ADH /1630 


Aan71630 


Human pro 


lb 


4obU . o 


97. 


9 


899 


8 


ADH7164 4 


Adh71644 


Human pro 


1 / 


/ICCC C 

4obb . o 


97. 


8 


899 


8 


ADH71646 


Adh7l646 


Human pro 


18 


A C C O C 


97. 


7 


899 


8 


ADH71650 


Adh71650 


Human pro 


19 


/ cl c o 
4552 


97. 


7 


898 


8 


ADH71626 


Adh71626 


Human pro 


2u 


/I CI /I c 

4D14 . D 


96.9 


y4 j 


4 


TV 1VjT**7 O 1 O O 

AAM /9128 


Aam79128 


Human pro 


21 


4511 


96. 


8 


o a n 

898 


2 


AAW78898 


Aaw78898 


Rat UNC-5 


22 


4511 


96. 


8 


898 


5 


AAXJ10543 


Aaul0543 


Rat netri 


23 


4511 


96. 


8 


898 


5 


AAU97900 


Aau97900 


Rat netri 


24 


4493 


96. 


4 


898 


7 


ADG42580 


Adg42580 


Rat trans 


ZD 


A O O O 

4282 


91. 


9 


833 


8 


Tl r\TT *"T 1 O O 

ADH71622 


Adh71622 


Human pro 


26 


4282 


91. 


9 


842 


5 


AAU74818 


Aau74818 


Human REP 


27 


4282 


91. 


9 


842 


8 


ADL06574 


Adl06574 


Human turn 


28 


4275 


91. 


7 


817 


8 


ADH71624 


Adh71624 


Human pro 


29 


2839 


60. 


9 


544 


7 


ADG42581 


Adg42581 


Human tra 


30 


2815 


60. 


4 


556 


2 


AAW78 899 


Aaw78899 


Human UNC 


31 


2759 . 5 


59. 


2 


931 


7 


ADG42582 


Adg42582 


Mouse tra 


32 


2747 . 5 


59.0 ' 


931 


4 


»»nc rv/"rti 

AAB50691 


Aab50691 


Human UNC 


33 


2747.5 


59. 


0 


931 


7 


ADE63098 


Ade63098 


Human Pro 


O A 

34 


2747.5 


59. 


0 


931 


7 


ABU64297 


Abu64297 


Human thr 


35 


2747 . 5 


59. 


0 


931 


8 


ADR99258 


Adr99258 


Human unc 


36 


2747 . 5 


59. 


0 


982 


4 


ABG11551 


Abgll551 


Novel hum 


37 


2736 


58. 


7 


964 


8 


ADR99250 


Adr99250 


Human 1R0 


38 


2733 . 5 


58. 


7 


931 


7 


ADG42584 


Adg42584 


Human tra 


39 


2688 . 5 


57. 


7 


929 


7 


ADG42583 


Adg42583 


Human tra 


40 


2560 . 5 


54. 


9 


943 


2 


AAW7 8 900 


. Aaw78900 


Rat UNC-5 


41 


2560 . 5 


54. 


9 


945 


7 


ADE63096 


Ade63096 


Rat Prote 


42 


2546.5 


54. 


6 


933 


5 


AA018734 


Aaol8734 


Human NOV 


43 


2546.5 


54. 


6 


933 


5 


AA018735 


Aaol8735 


Human NOV 


44 


2546.5 


54. 


6 


945 


5 


ABB09520 


Abb09520 


Human tra 


45 


2541.5 


54. 


5 


945 


4 


AAU12244 


Aaul2244 


Human PRO 



ALIGNMENTS 



RESULT 1 
AAU85403 

ID AAU85403 standard; protein; 898 AA. 
XX 

AC AAU85403; 
XX 

DT 21-MAY-2002 (first entry) 
XX 

DE Human protein N0V1. 
XX 

KW Human; NOVX; cardiomyopathy; atherosclerosis; diabetes; 

KW cell signal processing disorder; metabolic disorder; obesity; infection; 

KW anorexia; cancer-associated cachexia; cancer; neurodegenerative disorder; 

KW Alzheimer's disease; Parkinson's disease; immune disorder; 

KW haematopoietic disorders; dyslipidaemia; pain; asthma; hypertension; 



KW osteoporosis; Crohn's disease; multiple sclerosis; angina pectoris; 

KW . myocardial infarction; ulcer; allergy; benign prostatic hypertrophy; 

KW psychosis; neurological disorder; anxiety; schizophrenia; 

KW manic depression; dementia; dyskinesia; Huntington's disease; 

KW Gilles de la Tourette 1 s syndrome; gene therapy. 

XX 

OS Homo sapiens . 
XX 

PN WO200210216-A2. 
XX 

PD 07-FEB-2002. 
XX 

PF 30-JUL-2001; 2001WO-US024225 . 
XX 

PR 28-JUL-2000; 2000US-0221409P. 

PR 04-AUG-2000; 2000US-0222840P . 

PR 04-AUG-2000; 2000US-0223752P . 

PR 04-AUG-2000; 2000US-0223762P . 

PR 04-AUG-2000; 2000US-0223769P . 

PR 04-AUG-2000; 2000US-0223770P . 

PR 14-AUG-2000; 2000US-0225146P . 

PR 15-AUG-2000; 2000US-0225392P . 

PR 15-AUG-2000; 2000US-0225470P . 

PR 16-AUG-2000; 2000US-0225697P . 

PR 01-FEB-2001; 2001US-0263662P . 

PR 05-APR-2001; 2001US-0281645P . 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Padigaru M, Mezes P, Mishra V, Burgess C, Casman S, Grosse WM; 

PI Alsobrook JP, Lepley DM, Gerlach VL, Macdougall JR, Smithson G; 
XX 

DR WPI; 2002-180074/23. 

DR N-PSDB; ABK37922. 
XX 

PT New isolated cytoplasmic, nuclear, membrane bound, or secreted 

PT polypeptide, useful for treating cardiomyopathy, atherosclerosis, 

PT infections, cancer, neurodegenerative, metabolic, hematopoietic and 

PT immune disorders. 
XX 

PS Claim 1; Page 11; 213pp; English. 
XX 

CC The invention relates to an isolated cytoplasmic, nuclear, membrane 

CC bound, or secreted polypeptide (NOVX, x= 1-14) their variants or mature 

CC form. Also included are the nucleic acids encoding the NOVX proteins, a 

CC vector comprising the nucleic acid, a cell comprising the vector, an anti 

CC -NOVX antibody and modulators of NOVX. NOVX, the nucleic acid and the 

CC antibody are useful for treating or preventing a NOVX-associated 

CC disorder, where the disorder is selected from cardiomyopathy, 

CC atherosclerosis, diabetes, a disorder related to cell signal processing 

CC and metabolic pathway modulation, metabolic disorders, obesity, 

CC infectious disease, anorexia^ cancer-associated cachexia, cancer, 

CC neurodegenerative disorders, Alzheimer's disease, Parkinson's disease, 

CC immune disorders, haematopoietic disorders, and the various 

CC dyslipidaemias, metabolic disturbances associated with obesity, the 

CC metabolic syndrome X and wasting disorders associated with chronic 

CC diseases, bacterial, fungal, protozoal and viral infections, pain, 



CC bulimia, asthma, hypertension, urinary retention, osteoporosis, Crohn's 

CC disease, multiple sclerosis, Albright Hereditary Osteodystrophy, angina 

CC pectoris, myocardial infarction, ulcer, allergy, benign prostatic 

CC hypertrophy, and psychotic and neurological disorders, including anxiety, 

CC schizophrenia, manic depression, delirium, dementia, and dyskinesias, 

CC such as Huntington's disease and Gilles de la Tourette 1 s syndrome. The 

CC nucleic acid is useful in gene therapy. The present sequence represents a 

CC NOVX protein 
XX 

SQ Sequence 898 AA; 



Query Match 100.0%; Score 4660; DB 5; Length 898; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 873; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
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626 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 685 



Qy 661 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 720 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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RESULT 2 
ADH71618 



ID 


ADH71618 standard; protein; 898 AA. 


XX 






AC 


ADH71618; 




XX 






DT 


25-MAR-2004 


(first entry) 


XX 






DE 


Human protein of the invention NOV21e SEQ ID NO: 514. 


XX 






KW 


human; cytostatic; immunomodulator ; neuroprotective; nootropic; 


KW 


anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 


KW 


vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease 


KW 


obesity; diabetes; infectious disease; metabolic syndrome X; 


KW 


dyslipidaemia . 


XX 






OS 


Homo sapiens 




XX 






PN 


WO2003102155 


-A2. 


XX 






PD 
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PF 
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2002US-0385784P. 


PR 


05-JUN-2002; 


2002US-0386041P. 


PR 


05-JUN-2002; 


2002US-0386047P. 


PR 


06-JUN-2002; 


2002US-0386376P. 


PR 


06-JUN-2002; 


2002US-0386453P. 


PR 


06-JUN-2002; 


2002US-0386864P. 


PR 


06-JUN-2002; 


2002US-0387016P. 


PR 


07-JUN-2002; 


2002US-0386796P. 


PR 


07-JUN-2002; 


2002US-0386816P. 


PR 


07-JUN-2002; 


2002US-0386931P. 


PR 


07-JUN-2002; 


2002US-0386942P. 


PR 


07-JUN-2002; 


2002US-0386971P. 


PR 


07-JUN-2002; 


2002US-0387262P. 



PR 


08- 


JUN- 


zuuz , 


2U0ZUS- 


029 69 60P . 


PR 


10- 


JUN- 


2002 ; 


2002US- 


038 /400P . 


PR 


10- 


JUK- 


20U2 , 


zUUzUS- 


038 /o3oP . 


PR 


11- 


JUN- 


2002, 


2002US- 


0387610P. 


PR 


11- 


JUN- 


2002 , 


2002US- 


0387625P . 


PR 


11- 


JUN- 


2002 , 


2002US- 


0387634P . 


PR 


11- 


JUN- 


2002 , 


2002US- 


r\ c\ *~i s~ s~ e\ 

0387668P. 


PR 


11- 


JUN- 


2002 , 


2002US- 


0387696P . 


PR 


11- 


JUN- 


2002 , 


2002US- 


r\ ^ a n n a a 

0387702P. 


PR 


11- 


JUN- 


2002 , 


2002US- 


0387836P . 


PR 


11- 


JUN- 


2002, 


2002US- 


0387859P. 


PR 


12- 


JUN- 


2002 , 


2002US- 


0387933P . 


PR 


12- 


JUN- 


2002 , 


2002US- 


A A A A O A T-k 

0387934P . 


PR 


12- 


JUN- 


2002, 


2002US- 


0387960P. 


PR 


12- 


■ JUN- 


2002, 


2002US- 


0388022P . 


PR 


12- 


JUN- 


2002, 


2002US- 


0388096P. 


PR 


13- 


JUN- 


2002, 


2002US- 


0389123P . 


PR 


14- 


JUN- 


2002, 


2002US- 


0389118P . 


PR 


14- 


JUN- 


2002, 


2002US- 


0389120P. 


PR 


14- 


JUN- 


2002, 


2002US- 


0389144P. 


PR 


14- 


JUN- 


2002, 


2002US- 


0389146P. 


PR 


17- 


• JUN- 


2002, 


2002US- 


0389729P. 


PR 


17- 


JUN- 


2002, 


2002US- 


0389742P. 


PR 


18- 


JUN- 


2002, 


2002US- 


0389884P. 


PR 


19- 


■ JUN- 


2002, 


2002US- 


0390006P. 


PR 


19- 


•JUN- 


•2002, 


' 2002US- 


0390209P. 


PR 


21- 


■ JUN- 


-2002, 


2002US- 


0390763P. 


PR 


17- 


■ JUL- 


•2002, 


• 2002US- 


0396706P. 


PR 


06- 


•AUG- 


■2002, 


• 2002US- 


0401628P. 


PR 


09- 


-AUG- 


•2002, 


- 2002US- 


0402156P. 


PR 


09- 


AUG- 


•2002, 


• 2002US- 


0402256P. 


PR 


09- 


•AUG- 


-2002, 


• 2002US- 


0402389P. 


PR 


12- 


AUG- 


•2002, 


• 2002US- 


0402786P. 


PR 


12- 


•AUG- 


■2002, 


? 2002US- 


0402816P. 


PR 


12- 


-AUG- 


■2002, 


? 2002US- 


0402821P. 


PR 


12- 


-AUG- 


•2002, 


; 2002US- 


0402832P. 


PR 


13- 


-AUG- 


■2002, 


? 2002US- 


0403448P. 


PR 


13- 


-AUG- 


■2002, 


f 2002US- 


0403459P. 


PR 


13- 


-AUG- 


-2002, 


? 2002US- 


0403531P. 


PR 


13- 


AUG- 


•2002, 


; 2002US- 


0403532P. 


PR 


13- 


-AUG- 


-2002, 


; 2002US- 


0403563P . 


PR 


13- 


-AUG- 


-2002 


; 2002US- 


0406317P. 


PR 


15- 


-AUG- 


-2002 


; 2002US- 


0403617P. 


PR 


26- 


-AUG- 


-2002 


? 2002US- 


0406182P. 


PR 


26- 


-AUG- 


-2002 


; 2002US- 


0406355P. 


PR 


27- 


-AUG- 


-2002 


; 2002US- 


0406240P. 


PR 


12- 


-SEP- 


-2002 


; 2002US- 


0410084P . 


PR 


20- 


-SEP- 


-2002 


; 2002US- 


0412528P . 


PR 


23- 


-SEP- 


-2002 


; 2002US- 


0412731P. 


PR 


30- 


-SEP- 


-2002 


; 2002US- 


0414801P. 


PR 


30- 


-SEP- 


-2002 


; 2002US- 


0414839P. 


PR 


30- 


-SEP- 


-2002 


; 2002US- 


A j| 1 il A jl An 

0414840P. 


PR 


30- 


-SEP- 


-2002 


; 2002US- 


A jl 1 il A C Jl n 

•0414954P . 


PR 


09- 


-OCT- 


-2002 


y 2002US- 


•0417186P. 


PR 


09- 


-OCT- 


-2002 


? 2002US- 


•0417406P. 


PR 


23- 


-OCT- 


-2002 


? 2002US- 


-0420639P. 


PR 


28- 


-OCT- 


-2002 


? 2002US- 


-0421156P. 



PR 31-OCT-2002; 2002US-0422690P . 

PR 01-NOV-2002; 2002US-0423130P . 

PR 05-NOV-2002; 2002US-00423798 . 

PR 05-NOV-2002; 2002US-0423798P . 

PR 12-NOV-2002; 2002US-0425453P . 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Alsobrook JP, Alvarez E, Anderson DW, Boldog FL, Casman SJ; 

PI Catterton E, Chapoval A, Crabtree-Bokor JR, Edinger SR, Ellerman K; 

PI Ettenberg S, Gangolli EA, Gerlach VL, Gorman L, Gunther E, Guo X; 

PI Gusev VY, Herrmann JL, Ji< W, Kekuda R, Li L, Liu X, Macdougall JR; 

PI Maclachlan T, Malyankar UM, Mezick AJ, Millet I, Mishra VS; 

PI Padigaru M, Patturajan M, Pena CEA, Peyman JA, Raha D, Rastelli L; 

PI Rieger DK, Rothenberg ME, Sciore P, Shenoy SG, Shimkets RA; 

PI Smithson G, Spytek KA, Stone DJ, Vernet CAM, Voss EZ, Zhong M; 

PI Zhong H; 

XX 

DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71617. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful •, for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 514; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator, < 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics. The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 898 AA; 

Query Match 100.0%; Score 4660; DB 8; Length 898; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 873; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 QQ S ATVAN P VP GAN P DLL P H FLVE P E DVY I VKNK P VLLVC KAVP ATQ I FFKCN GEWVRQV 60 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 26 QQSAWANPWGANPDLLPHFLVEPEDVYIVTCNKPVXL 85 



QY 
Db 



61 DHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I 
86 DHVI ERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWS S SGTTKSQKAYI RI A 145 



Qy 

Db 



121 
146 



RLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 
I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I ! I I II I I I I I I I I I I I I I I I I I I I I I I I 
RLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 



180 

205 



Qy 181 LVVRQARLADTANYTCVAKNIVARRRSASAAVIVYWGGWSTWTEWSVCSASCGRGWQKR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 206 LVVRQARLADTAN YT C VAKN I VARRRS AS AAVI VYVN GGW S TWT EW S VC S AS C GRGWQKR 265 

Qy 241 S RS CTN PAP LNGGAFC EGQNVQ KT ACAT LC PVDG SW S PW S KW S ACGL DCT HWRS REC S D P 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 266 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 325 

Qy 301 APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 360 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 326 APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 385 

Qy 361 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 386 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 445 

Qy 421 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 480 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 446 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 505 

Qy 481 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 540 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 506 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 565 

Qy 541 VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 566 VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 625 

Qy 601 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 626 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 685 

Qy 661 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 720 

I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 686 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 745 

Qy 721 CTFTLERVSPSTSDIiACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVG 780 

I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 746 CTFTLERVSPSTSDIiACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVG 805 

Qy 781 PSAFKI PFLI RQKI I SSLDPPCRRGADWRTLAQKLHLDSHLS FFASKPS PTAMI LNLWEA 840 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 806 PSAFKI PFLI RQKI I SSLDPPCRRGADWRTLAQKLHLDSHLS FFASKPS PTAMI LNLWEA 865 

Qy 841 RHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 873 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 866 RH F PNGN L S QLAAAVAGLGQ P DAG L FTVS EAEC 898 



RESULT 3 
AAU97899 

ID AAU97899 standard; protein; 898 AA. 
XX 



AC AAU97899; 
XX 

DT 27-AUG-2002 (first entry) 
XX 

DE Human netrin binding membrane receptor UNC5H-1 protein. 
XX 

KW Netrin binding membrane receptor; receptor; UNC5H-1; human; nootropic; 

KW neuroprotective; cytostatic; antiparkinsonian; cerebroprotective; cancer; 

KW central nervous system; CNS; stroke; Parkinson's disease; 

KW multiple sclerosis; Alzheimer's disease. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT Domain 152. .223 

FT /note= "Immunoglobulin domain " 

FT Domain 247. .294 

FT /note= "Thrombospondine type 1 domain " 

FT Domain 302. .348 , 

FT /note= "Thrombospondine type 1 domain" 

FT Region 361. .382 

FT /npte= "Transmembrane region" 

FT Domain 495. .598 

FT /note= "ZU5 domain" 

FT Domain 817. .897 

FT /note= "Death domain" 

XX 

PN WO200233080-A2. 
XX 

PD 25-APR-2002. 
XX 

PF 15-OCT-2001; 2001WO-EP011891 . 
XX 

PR 16-OCT-2000; 2000US-0240061P. 
XX 

PA (FARB ) BAYER AG. 
XX 

PI Koehler RH; 
XX 

DR WPI; 2002-463314/49. 

DR N-PSDB; ABK52891. 
XX 

PT Novel human netrin binding membrane receptor polypeptide and 

PT polynucleotides for identifying modulating agents useful in treating 

PT diseases e.g. Parkinson's disease, multiple sclerosis, stroke, 

PT Alzheimer's disease. 

XX 

PS Claim 1; Fig 2; 94pp; English. 
XX 

CC This invention relates to the DNA and protein sequences of a novel 

CC purified human netrin binding membrane receptor, UNC5H-1. The DNA 

CC sequence of the invention is useful as a probe for detecting a nucleic 

CC acid encoding the UNC5H-1 protein in a biological sample. The sequences 

CC of the invention are useful to screen for agents which decrease the 

CC activity of the UNC5H-1 protein. The sequences are also useful for 

CC screening agents which regulate (modulate) the activity of the protein of 

CC the invention. A pharmaceutical composition containing the protein of the 



CC invention or a reagent that modulates the activity of the UNC5H-1 protein 

CC may be useful for treating a UNC5H-1 dysfunction related disease such as 

CC cancer or a central nervous system (CNS) disorders (e.g, Parkinson's 

CC disease, multiple sclerosis, stroke and Alzheimer's disease). Fusion 

CC proteins comprising the UNC5H-1 protein are useful for generating 

CC antibodies and for in various assay systems, and the protein can be used 



CC as a bait protein in a two-hybrid assay or three-hybrid assay. The method 

CC of the invention is useful for detecting a coding sequence for the UNC5H- 

CC 1 protein. The present sequence represents the human netrin binding 

CC membrane receptor UNC5H-1 protein of the invention 
XX 

SQ Sequence 898 AA; 



Query Match 99.8%; Score 4650; DB 5; Length 898; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 871; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



Qy 


1 


QQ S ATVAN P VP GAN P D LL PH FLVE P E DVY I VKN K P VL LVC KAVP ATQ I F FKCN GEWVRQV 


60 






I 1 1 1 1 1 I i 1 1 1 1 1 1 1 1 1 I 1 i 1 1 i 1 1 I 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
i i i i i i i i i i i i i i i i i 1 i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i f 




Db 


26 


QQ SAT VAN P VP GAN P DLL PH FLVE P E DVY I VKN K PVL LVC KAVP ATQ I FFKCN GEWVRQV 


85 


Qy 


61 


DHVI ERSTDGS S GLPTMEVRI NVS RQQVEKVFGLEE YWCQCVAWS S S GTTKS QKAYI RI A 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 
i i i i i i i i i i i i i i i i i i i \ \ i i i i i i i i i i i i i \ i i i i 1 i i i i i i i i i i i i i i i p i i i 1 




Db 


86 


DHVI ERSTDGS S GLPTMEVRI NVS RQQVEKVFGLEE YWCQCVAWS S SGTTKSQKAYI RI A 


145 


Qy 


121 


RLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAEv^WLRNEDLVT)PSLDPNVYITREHS 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 [ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 ! 1 1 1 1 1 1 1 




Db 


146 


YLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 


205 


Qy 


181 


LVVRQARIADTANYTCVAKNIVARRRSASAAVIWWGGWSTWTEWSVCSASCGRGWQKR 


240 






1 1 1 1 1 1 1 1 1 1 11 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 

i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i I i i i i i i i 




Db 


206 


LVVRQARLADTANYTCVAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKR 


265 


Qy 


241 


SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


266 


SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 


325 


Qy 


301 


APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


326 


APRNGGEECQGTDLDTRNCTSDLCVHTASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 


385 


Qy 


361 


KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 


420 






1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


386 


KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 


445 


Qy 


421 


PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


446 


PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 


505 


Qy 


481 


LGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


506 


LGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLS PI VSCGPPG 


565 


Qy 


541 


VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 


600 






1 1 t 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 I 1 1 1 I 1 j 1 1 1 I 1 1 1 1 1 1 1 1 I 1 1 1 




Db 


566 


VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 


625 



( 

Qy 601 VFTEQLGRFALVGEAL S VAAAKRLKLLL FAPVACT S LE YN I RVYCLHDTHDALKEWQLE 660 

I I I I I I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 626 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 685 

Qy 661 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 686 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 745 

Qy 721 CT FT LERVS P ST S DLACKLWVWQVEGDGQS FS I N FN I T KDT RFAELLALE S EAGVPALVG 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 746 CT FT LERVS P ST S DLACKLWVWQVEGDGQS FS IN FN I TKDT RFAELLALE S EAGVPALVG 805 

Qy 781 PSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 806 PSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 865 

Qy 841 RHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 873 

I I I I I I I I I I I I I t I I I I I I I II I I II I I I I I I 
Db 866 RHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 



RESULT 4 


AAU79939 


ID 


AAU79939 standard; protein; 899 AA. 


XX 




AC 


AAU79939; 


XX 




DT 


15-JUL-2002 (first entry) 


XX 




DE 


Human UNC5-like protein NOV1 . 


XX 




KW 


Human; NOVX polypeptide; cardiomyopathy; atherosclerosis; cancer; 


KW 


cell signal processing; metabolic pathway modulation; cancerous tissue; 


KW 


antibody; diabetes; transgenic animal; UNC5-like protein; NOV1; 


KW 


chromosome 13. 


XX 




OS 


Homo sapiens . 


XX 




PN 


WO200229038-A2. 


XX 




PD 


ll-APR-2002. 


XX 




PF 


04-OCT-2001; 2001WO-US031377 . 


XX 




PR 


04-OCT-2000; 2000US-0237862P . 


XX 




PA 


(CURA-) CURAGEN CORP. 


XX 




PI 


Herrmann JL, Rastelli L, Shimkets RA; 


XX 




DR 


WPI; 2002-340104/37. 


DR 


N-PSDB; ABK49422. 


XX 




PT 


Novel isolated NOVX polypeptide, and encoded polynucleotide, useful for 


PT 


treating cardiomyopathy, artherosclerosis, and cancer. 


XX 




PS 


Claim 1; Page 9; 180pp; English. 



CC The present invention relates to a new NOVX polypeptide having a 900 

CC (N0V1), 4349 (N0V2), 940 (NOV3) , 798 (NOV4), 865 (NOV5) , or 331 (NOV6) 

CC residue amino acid sequence, as given in the specification. The novel 

CC polypeptide, and its encoding polynucleotide, are used to treat 

CC cardiomyopathy, atherosclerosis, cancer or a disease related to cell 

CC signal processing and metabolic pathway modulation, in a human. Detecting 

CC the polypeptide or polynucleotide's useful for identifying cancerous 

CC tissue. The antibody can be used to treat diabetes or cancer. The host 

CC ceils can be used to produce non-human transgenic animals useful in drug 

CC screening. The present amino acid sequence is that of the human UNC5-like 

CC protein N0V1 of the invention. This sequence is encoded by the human UNC5 

CC -like NOV1 gene located on chromosome 13 

XX 

SQ Sequence 899 AA; 

Query Match 98.0%; Score 4567.5; DB 5; Length 899; 

Best Local Similarity 98.6%; Pred. No. 0; 

Matches 863; Conservative 2; Mismatches 7; Indels 3; Gaps 3; 

QQSATVANPVPGANPDLLPHFLVEPEDVTIVT^ 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I II I I I I 
QQSATVANPVPGANPDLLPHFLV^PEDVYIV1<NKPVLLVCKAVPATQIFFKCNGEWVRQV 85 

DHVIERSTDGSSGLPTMEV^INVSRQQV^KVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
DHVI ERSTDGS SGEPTMEVRINVS RQQVEKVFGLEE YWCQCVAWS S S GTTKSQKAYI RI A 145 

RLRKNFEQEPIAKEVSLEQGIVXPCRPPEGIPPAEVT^LRNEDLVTDPSLDPNWITREHS 180 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
RLRKNFEQEPLAKEVSLEQGIVT J PCRPPEGIPPAE\^WLRNEDLVT)PSLDPNWITREHS 205 

L WRQARLADTANYTCVAKNI VARRRSASAAVI VYVNGGWSTWTEWSVCSASCGRGWQKR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
LWRQARLADTAN YT CVAKN I VARRRS AS AAVI VYVN GGW S TWT EWS VC S AS C GRGWQ KR 2 65 

S RSCTN PAPLNGGAFCEGQNV-QKTACATLCPVT)GSWS PWS KWSACGLDCTHWRS RECSD 299 
I I I I I I I I I I I II I I I I I I I I : I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
SRSCTNPAPLNGGATCEGQNVHDRTVSSLLVSVDGSWSPWSKWSACGLDCTHWRS RECSD 325 

PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 359 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 385 

RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGP 419 
I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I II • 
RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLS-TTTTYQGSLCPRQDGP 444 

SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 479 
I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 504 

FLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 539 
I I I I I I I I I I II I I I I I I I I I I I I I I III III I I I I! I I I I I II I I II I I I I II I I I I I I 
FLGGRLMI PNTGISLLIPPDAIPRGKIYEI YLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 564 



Qy 


1 


Db 


26 


Qy 


61 


Db 


86 


Qy 


121 


Db 


146 


Qy 


181 


Db 


206 


Qy 


241 


Db 


266 


Qy 


300 


Db 


326 


Qy 


360 


Db 


386 


Qy 


420 


Db 


445 


Qy 


480 


Db 


505 



Qy 



540 GVLLTRPVILAMDHGGEPSPDSWSLRLKKQSCEGSWE-DVLHLGEEAPSHLYYCQLEASA 598 



Db 



565 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GVLLTRPVT LAMDHCGEPS PDSWSLRLKKQSCEGSWEQDVLHLGEEAPSHLYYCQLEASA 624 



Db 



599 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 658 

I I I I I I I I I I I I I II I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
625 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 684 



Qy 



659 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 718 




Db 



685 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 744 



Qy 



Db 



719 LHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 778 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
745 LHCTFTLERVSPSTSDLACKLWWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 804 



Qy 



779 VGPSAFKI PFLI RQKI I S SLDPPCRRGADWRTLAQKLHLDSHLS FFASKPSPTAMI LNLW 838 




Db 



805 VGPSAFKI PFLI RQKI IS SLDPPCRRGADWRTLAQKLHLDSHLS FFASKPSPTAMI LNLW 864 



Qy 



Db 



839 EARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 873 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
865 EARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 899 



RESULT 5 


ADG42569 


ID 


ADG42569 standard; protein; 899 AA. 


XX 




AC 


ADG42569; 


XX 




DT 


26-FEB-2004 (first entry) 


XX 




DE 


Novel human N0V1 . 


XX 




KW 


cytostatic; gene therapy; NOVX-agonist; NOVX-antagonist; pharmaceutical; 


KW 


NOVX-associated disorder; cancer; human. 


XX 




OS 


Homo sapiens. 


XX 




PN 


US2003204052-A1. 


XX 




PD 


30-OCT-2003. 


XX 




PF 


04-OCT-2001; 2001US-00970944 . 


XX 




PR 


04-OCT-2000; 2000US-0237862P . 


XX 




PA 


(HERR/) HERRMANN J L. 


PA 


(RAST/) RASTELLI L. 


PA 


(SHIM/) SHIMKETS R A. 


XX 




PI 


Herrmann JL, Rastelli L, Shimkets RA; 


XX 




DR 


WPI; 2003-900673/82. 


DR 


N-PSDB; ADG42568. 


XX 




PT 


New NOVX gene or NOVX-specif ic antibody, useful for preparing a 



PT composition for treating or preventing a NOVX-associated disorder, e.g., 

PT . cancer. 

XX 

PS Claim 1; SEQ ID NO 2; 118pp; English. 
XX 

CC The invention describes a new isolated polypeptide comprising: a 

CC polypeptide or its mature form comprising a sequence not given in the 

CC specification; or a variant of (A) , where one or more amino acid residues 

CC in the variant differs in no more than 15% from the amino acid sequence 

CC of the mature form. The pharmaceutical composition may be administered 

CC via oral, transdermal, rectal or parenteral route. The polypeptide, 

CC nucleic acid or antibody is useful for preparing a composition for 

CC treating or preventing a NOVX-associated disorder, e.g., cancer. This is 

CC the amino acid sequence of a human NOVX protein. 

XX 

SQ Sequence 899 AA; 



Query Match 98.0%; Score 4567.5; DB 7; Length 899; 

Best Local Similarity 98.6%; Pred. No. 0; 

Matches 863; Conservative 2; Mismatches 7; Indels 3; Gaps 3; 



Qy 1 QQ S ATVAN P VP GAN P D L L P H FLVE P EDVY I VKN K P VL LVC KAVP ATQ I F FKCN GEWVRQV 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 26 QQ SAT VAN P VP GAN P D L L P H FLVE P E DVY I VKN K P VL LVC KAVP ATQ I F FKCN G EWVRQV 85 

Qy 61 DHVI ERSTDGS SGLPTMEVRINVS RQQVEKVFGLEE YWCQCVAWS S S GTTKSQKAYI RI A 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 86 DHVI ERSTDGS SGEPTMEVRINVS RQQVEKVFGLEE YWCQCVAWS S S GTTKSQKAYI RI A 145 

Qy 121 RLRKNFEQEPIJ^EVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 146 RLRKNFEQEPLAKEVSLEQGIVTiPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 205 

Qy 181 LWRQARLADT AN YT CVAKN I VARRRS AS AAVI VYVN GGW S T WTEW S VC S AS C GRGWQKR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 206 LVVRQARLADTANYT CVAKN I VARRRS AS AAVI W 265 

Qy 241 S RS CTNPAPLNGGAFCEGQNV-QKTACATLCPVDGSWS PWS KWSACGLDCTHWRS RECSD 299 

I I I I I I II I I I I I I I I I I I I I : I : I MINIM III I I I I I I I I I I I I I I I I I 
Db 266 SRSCTNPAPLNGGAFCEGQNVHDRTVSSLLVSVDGSWS PWS KWSACGLDCTHWRS RECSD 325 

Qy 300 PAPRNGGEECQGTDLDTRNCTSDLCWSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 359 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 326 PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDV7VLYVGLIAVAVCLVLLLLVLILVYC 385 

Qy 360 RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGP 419 

I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 386 RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLS-TTTTYQGSLCPRQDGP 444 

Qy 420 SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 479 

I I I I I I It I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 445 SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 504 

Qy 480 FLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 539 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II II I II II I 
Db 505 FLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 564 



Qy . 540 GVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWE-DVLHLGEEAPSHLYYCQLE7\SA 598 

I I I I I I I I I I II I I I I I I I I I I I I I 1-1 I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I 
Db 565 GVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEQDVLHLGEEAPSHLYYCQLEASA 624 

Qy 599 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 658 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 625 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 684 

Qy 659 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 718 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 685 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 744 

Qy 719 LHCTFTLERVSPSTSDLACKLWWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 778 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 745 LHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 804 

Qy 779 VGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLW 838 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 805 VGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLW 864 

Qy 839 EARH FPNGN L S QLAAAVAGLGQ P DAGL FT VS EAE C 873 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db -865 EARH FPNGNL S QLAAAVAGLGQ P DAGL FTVS EAEC 899 



RESULT 6 
ADH71610 

ID ADH71610 standard; protein; 899 AA. 
XX 

AC ADH71610; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention NOV21a SEQ ID NO: 506. 
XX 

KW human; cytostatic; immunomodulator ; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 

KW dyslipidaemia . . 

XX 

OS Homo sapiens . 
XX 

PN WO2003102155-A2. 
XX 

PD ll-DEC-2003. 
XX 

PF 03-JUN-2003; 2003WO-US017430 . 
XX 

PR 03-JUN-2002; 2002US-0385120P . 

PR 04-JUN-2002; 2002US-0385784P. 

PR 05-JUN-2002; 2002US-0386041P . 

PR 05-JUN-2002; 2002US-0386047P . 

PR 06-JUN-2002; 2002US-0386376P. 

PR 06-JUN-2002; 2002US-0386453P . 

PR 06-JUN-2002; 2002US-0386864P . 

PR 06-JUN-2002; 2002US-0387016P . 
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— JUN- 
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PR 
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PR 
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0 / 


- JUN- 
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PR 


a o 
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- JUN- 


A A A A 

2002 , 


' 2002US- 


AA A/TAifAA 

0296960P, 


PR 


i a 
10 


- JUN- 


A A A A 

2002 t 


; 2002US- 


0387400P. 


PR 


n a 
10 


- JUN- 


A A A A 

2002 , 


AAA OTTn 

; 2002US- 


0387535P . 


PR 


11 


- JUN- 


2002 , 


• 2002US- 


0387610P . 


PR 


11 


- JUN- 


2002 t 


' 2002US- 


r\ -"i o ""j /■* a c i-\ 

0387625P . 


PR 


11 


- JUN- 


2002 , 


; 2002US- 


AAA*^^"0 ji r\ 

0387634P. 


PR 


i i 
11 


- JUN- 


A A A A 

2002 , 


• 2002US- 


r\ r\ s~ ^ f\ t-v 

0387668P. 


PR 


11 


- JUN- 


2002 , 


• 2002US- 


0387696P. 


PR 


11 


- JUN- 


2002 , 


• 2002US- 


0387702P. 


PR 


11 


- JUN- 


2002 , 


* 2002US- 


0387836P . 


PR 


11 


- JUN- 


2002 , 


* 2002US- 


0387859P. 


PR 


12 


- JUN- 


2002 , 


• 2002US- 


0387933P. 


PR 


12 


- JUN- 


2002, 


• 2002US- 


0387934P. 


PR 


12 


- JUN- 


2002 , 


• 2002US- 


0387960P . 


PR 


12 


- JUN- 


2002, 


• 2002US- 


0388022P. 


PR 


12 


- JUN- 


2002, 


2002US- 


0388096P- 


PR 


13 


- JUN- 


2002 , 


2002US- 


0389123P. 


PR 


14 


-JUN- 


2002, 


2002US- 


A A A A t -1 /"* i-v 

0389118P. 


PR 


14 


- JUN- 


2002 , 


2002US- 


0389120P. 


PR 


14 


-JUN- 


2002, 


2002US- 


0389144P. 


PR 


14 


-JUN- 


2002, 


2002US- 


0389146P. 


PR 


17 


- JUN- 


2002, 


2002US- 


0389729P. 


PR 


17 


- JUN- 


2002, 


2002US- 


0389742P. 


PR 


18 


-JUN- 


2002, 


2002US- 


0389884P. 


PR 


19 


- JUN- 


2002, 


2002US- 


0390006P. 


PR 


19 


- JUN- 


2002, 


2002US- 


0390209P. 


PR 


21 


- JUN- 


2002 , 


2002US- 


0390763P. 


PR 


17 


- JUL- 


2002, 


2002US- 


0396706P. 


PR 


06 


-AUG- 


2002 , 


2002US- 


0401628P . 


PR 


09 


-AUG- 


2002 , 


2002US- 


0402156P. 


PR 


09 


-AUG- 


2002 , 


2002US- 


0402256P. 


PR 


09 


-AUG- 


2002 , 


2002US- 


0402389P. 


PR 


12 


-AUG- 


2002, 


2002US- 


0402786P. 


PR 


12 


-AUG- 


2002, 


2002US- 


0402816P, 


PR 


12 


-AUG- 


2002 , 


2002US- 


0402821P. 


PR 


12 


-AUG- 


2002 , 


2002US- 


0402832P. 


PR 


13 


-AUG- 


2002 , 


2002US- 


0403448P. 


PR 


13 


-AUG- 


2002 , 


2002US- 


0403459P. 


PR 


13 


-AUG- 


2002 , 


2002US- 


0403531P. 


PR 


13 


-AUG- 


2002 , 


2002US- 


0403532P. 


PR 


13 


-AUG- 


A A A A 

2002 , 


2002US- 


0403563P. 


PR 


13 


-AUG- 


2002 , 


2002US- 


0406317P. 


PR 


15 


-AUG- 


2002 , 


2002US- 


0403617P. 


PR 


26 


-AUG- 


2002 , 


* 2002US- 


A , A /" 1 OAt> 

0406182P. 


PR 


A /"" 

26 


-AUG- 


A A A A 

2002 , 


2002US- 


A /I A/* A C r n 

0406355P. 


PR 


27 


-AUG- 


2002 , 


2002US- 


A )l A /" A i) 

0406240P. 


PR 


12 


-SEP- 


2002, 


2002US- 


0410084P. 


PR 


20 


-SEP- 


2002, 


2002US- 


0412528P. 


PR 


23 


-SEP- 


2002, 


• 2002US- 


0412731P. 


PR 


30 


-SEP- 


2002, 


2002US- 


0414801P. 


PR 


30 


-SEP- 


2002, 


• 2002US- 


0414839P. 



o ft 
30 


-SEP- 


O ft ft o . 

•2002 ; 


ft ft ft ft T T O 

20U2US- 


ft ^ 1 A Q A r\T% 

04 14840P . 


o ft 
30- 


O XT' T> 

-SEP- 


O ft ft o > 

-2002 ; 


ft ft ft ftTT O 

20U2US- 


04 14 9o4P . 


ft n 

09- 


-OCT- 


ft ft ft ft . 
2 002 ; 


ft ft ft ft T T O 

20U2US- 


04171ooP . 


ft ft 

09 


-OCT- 


ft ft ft ft . 
2002 ; 


ft ft f \ ftTT O 

2002US- 


0417406P . 


ft 1 


-OCT- 


ft ft ft ft . 
•2002 ; 


ft ft ft ft T T O 

2002US- 


ftyiftft^roftn 

0420639P . 


ft o 

28- 


-OCT- 


ft ft ft ft . 
2 002 ; 


O ft f \ ottp 

2002US- 


ft/lftl 1 C^r» 

0421156P . 


or 


-OCT- 


ft ft ft ft . 
•2002 ; 


ft ft ft ft T T O 

2002US- 


ftyl ftft/^ftftr> 

0422690P . 


01- 


-NOV- 


2002; 


2002US- 


0423130P. 


05- 


-NOV- 


•2002; 


2002US- 


00423798. 


05- 


-NOV- 


2002; 


2002US- 


0423798P. 


12- 


-NOV- 


2002; 


2002US- 


0425453P. 



PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Alsobrook JP, Alvarez E, Anderson DW, Boldog FL, Casman SJ; 

PI Catterton E, Chapoval A, Crabtree-Bokor JR, Edinger SR^ Ellerman K; 

PI Ettenberg S, Gangolli EA, Gerlach VL f Gorman L, Gunther E, Guo X; 

PI Gusev VY, Herrmann JL, Ji W, Kekuda R, Li L, Liu X, Macdougall JR; 

PI Maclachlan T f Malyankar UM, Mezick AJ, Millet I, Mishra VS; 

PI Padigaru Patturajan M, Pena CEA, Peyman JA, Raha D, Rastelli L; 

PI Rieger DK, Rothenberg ME, Sciore P, Shenoy SG, Shimkets RA; 

PI Smithson G, Spytek KA, Stone DJ, Vernet CAM, Voss EZ, Zhong M; 

PI Zhong H; 

XX 

DR WPI; 2004-081935/08. 
DR N-PSDB; ADH71609. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 
PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 
PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 506; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator , 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics. The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 899 AA; 



Query Match 98.0%; Score 4567.5; DB 8; Length 899; 

Best Local Similarity 98.6%; Pred. No. 0; 

Matches 863; Conservative 2; Mismatches 7; Indels 3; Gaps 3; 

Qy 1 QQSATVANPVTGANPDLLPHFLVllPEDWIvT^NKPVliLVCKAVPATQIFFKCNGEWWQV 60 

I I I I II I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I 



26 QQSAWANPVPGANPDLLPHFLVEPEDWIVKNKPVLLVCKAVPATQIFFKCNGEWVRQV 85 



61 DHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I- I I I I I I I I I I I I I I I I I I I I I I I I 
86 DHVIERSTDGSSGEPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 145 

121 RLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
146 RLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 205 

181 L WRQARLADT AN YT C VAKN I VARRRS AS AAVI VYVN GGW S TWT EW SVC S AS C GRGWQ K R 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I 
206 LWRQARLADTAN YT CYAKN I VARRRS AS AAVI VYVN GGW S TWT EWS VC S AS C GRGWQ KR 265 

241 SRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSD 299 

I I I I I I I I I I I I I I I I I II I I : I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
266 SRSCTNPAPLNGGAFCEGQNVHDRTVSSLLVSVDGSWSPWSKWSACGLDCTHWRSRECSD 325 

300 PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 359 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
326 PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVGLVLLLLVLILVYC 385 

360 RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGP 419 

I I I I I I I I I I J I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
386 RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLS-TTTTYQGSLCPRQDGP 444 

420 SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 479 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
445 SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 504 

480 FLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 539 

I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
505 FLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 564 

540 GVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWE-DVLHLGEEAPSHLYYCQLEASA 598 

I I I I II I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
565 GVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEQDVLHLGEEAPSHLYYCQLEASA 624 

599 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 658 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
625 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 684 

659 LEKQLGGQLIQEPRVLHFKDS YHNLRLS I HDVPS S LWKS KLLVS YQEI PFYHIWNGTQRY 718 

I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
685 LEKQLGGQLI QEPRVLHFKDS YHNLRLS I HDVPS S LWKSKLLVS YQEI P FYHI WNGTQRY 744 

719 LHCT FTLERVS P STS DLACKLWWQVEGDGQS FS INFNI TKDTRFAELLALES EAGVPAL 778 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I 

745 LHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 804 

779 VGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLW 838 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 

805 VGPSAFKI PFLI RQKI I SSLDPPCRRGADWRTLAQKLHLDSHLS FFASKPS PTAMI LNLW 864 

839 EARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 873 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
865 EARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 899 
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ADH71628 

ID ADH71628 standard; protein; 899 AA. 
XX 

AC ADH71628; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention N0V21j SEQ ID NO: 524. 
XX 

KW human; cytostatic; immunomodulator ; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 

KW dyslipidaemia . 

XX 

OS Homo sapiens. 
XX 

PN WO2003102155-A2 . 
XX 
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DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71627. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 524; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator, 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics. The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 899 AA; 



Query Match 98.0%; Score 4567.5; DB 8; Length 899; 

Best Local Similarity 98.6%; Pred. No. 0; 

Matches 863; Conservative 2; Mismatches 7; Indels 3; Gaps 3 



Qy 


i 


QQ SAT VAN P VP GAN PDLL P H FLVE P EDVY I VKN K P VLLVCKAVP ATQ I F FKCN G EWVRQV 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 




Db 


26 


QQ SAT VAN P VP GAN P DLL P H FLVE P EDVYI VKN K PVLLVC KAVP ATQ I F FKCN G EWVRQV 


85 


Qy 


61 


DHVI ERSTDGS SGLPTMEVRINVS RQQVEKVFGLEEYWCQCVAWS S SGTTKSQKAYI RI A 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 




Db 


86 


DHVIERSTDGSSGEPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 


145 


Qy 


121 


RLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAEV^WLRNEDLVDPSLDPNVYITREHS 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


146 


RLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAEV^WLRNEDLVDPSLDPNVYITREHS 


205 


Qy 


181 


LWRQARLADTAN YT CVAKN I VARRRS AS AAVI VYVNGGW S TWT EW S VC S AS C GRGWQ KR 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


206 


LWRQARLADTAN YTCVAKNIVARRRSASAAVIVTWGGWSTWTEWSVCSASCGRGWQKR 


265 


Qy 


241 


SRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSD 


299 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


266 


SRSCTNPAPLNGGAFCEGQNVHDRTVSSLLVSVDGSWSPWSKWSACGLDCTHWRSRECSD 


325 


Qy 


300 


PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 


359 






1 III 1 II Mill II II III III 1 1 1 1 1 1 1 II II II II 1 1 1 1 II 1 II 1 1 II II HUM II 




Db 


326 


PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 


385 


Qy 


360 


RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGP 


419 



Db 



386 



I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 1 I I I 
RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLS-TTTTYQGSLCPRQDGP 



444 



Qy 420 SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 479 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 445 SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 504 

Qy 480 FLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 539 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 505 FLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 564 

Qy 540 GVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWE-DVLHLGEEAPSHLYYCQLEASA 598 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 565 GVLLTRPVI LAMDHCGEPS PDSWSLRLKKQSCEGSWEQDVLHLGEEAPSHLYYCQLEASA 624 

Qy 599 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 658 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 625 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 684 

Qy 659 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 718 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 685 LEKQLGGQLIQEPRVLHFKDS YHNLRLS IHDVPS SLWKSKLLVS YQEI PFYHIWNGTQRY 744 

Qy 719 LHCTFTLERVSPSTSDIACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 778 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 745 LHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 804 

Qy 779 VGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMI LNLW 838 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 805 VGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMI LNLW 864 

Qy 839 EARH F PNGN L S Q LAAAVAGLGQ P DAGL FT VS EAEC 873 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 865 EARH F PN GNL S QLAAAVAGLGQ P DAGL FT VS EAEC 899 
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XX 

DE Human protein of the invention NOV21n SEQ ID NO: 532. 
XX 

KW human; cytostatic; immunomodulator ; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 

KW dyslipidaemia. 

XX 

OS Homo sapiens. 
XX 

PN WO2003102155-A2. 
XX 

PD ll-DEC-2003. 
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PA . (CURA-) CURAGEN CORP. 
XX 

PI Alsobrook JP, Alvarez E, Anderson DW, Boldog FL, Casman SJ; 

PI Catterton E, Chapoval A, Crabtree-Bokor JR, Edinger SR, Ellerman K; 

PI Ettenberg S, Gangolli EA, Gerlach VL, Gorman L, Gunther E, Guo X; 

PI Gusev VY, Herrmann JL, Ji W, Kekuda R, Li L, Liu X, Macdougall JR; 

PI Maclachlan T, Malyankar UM, Mezick AJ, Millet I, Mishra VS; 

PI Padigaru M, Patturajan Pena CEA, Peyman JA^ Raha D, Rastelli L; 

PI Rieger DK, Rothenberg ME, Sciore P, Shenoy SG, Shimkets RA; 

PI Smithson G, Spytek KA, Stone DJ, Vernet CAM, Voss EZ, . Zhong M; 

PI Zhong H; 

XX 

DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71635. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmaco genomics . 
XX 

PS Example 21; SEQ ID NO 532; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator, 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer f s disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 



CC typing, preventive medicine, and pharmacogenomics . The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 899 AA; 

Query Match 98.0%; Score 4566.5; DB 8; Length 899; 

Best Local Similarity 98.5%; Pred. No. 0; 

Matches 862; Conservative 3; Mismatches 7; Indels 3; Gaps 3; 

Qy 1 QQSATVANPVPGANPDLLPHFLVTlPEDVTIvl^KPv^LVCKAVPATQIFFKCNGEWVl^QV 60 

I I II I I II I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 26 QQSAWANPVPGANPDLLPHFLV^PEDWIVl^KPVXLVCKAVPATQIFFKCNGEWVRQV 85 

Qy 61 DHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 86 DHVIERSTDGSSGEPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 145 

Qy 121 RLRKNFEQEPIJUCEVSLEQGIV^PCRPPEGIPPAEV^WLRNEDLV^PSLDPNVYITREHS 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 146 RLRKNFEQEPLAKEVSLEQGI VTjPCRPPEGIPPAEV^WLRNEDLVT)PSLDPNWITREHS 205 

Qy 181 LWRQARLADTAN YT CVAKN I VARRRS AS AAVI VTVNGGWSTWTEWS VC S AS CGRGWQKR 240 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 206 LWRQARLADTANYTCVAKNIVARRRSASAAV 265 

Qy 241 SRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSD 299 

I I I I I I I I I I I I I I I I I I I I I : I : I I I I I II I I I I I I I I II I I I I I I I I I I II 
Db 266 SRSCTNPAPLNGGAFCEGQNVHDRTVSSLLVSVDGSWSPWSKWSACGLDCTHWRSRECSD 325 

Qy 300 PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 359 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 326 PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 385 

Qy 360 RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGP 419 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 386 RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLS-TTTTYQGSLCPRQDGP 444 

Qy 420 SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 479 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 445 SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 504 

Qy 480 FLGGRLMIPNTGISLLIPPDAIPRGKI YEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 539 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 505 FLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 564 

Qy 540 GVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWE-DVLHLGEEAPSHLYYCQLEASA 598 

I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 565 GVXLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEQDVLHLGEEAPSHLYYCQLEASA 624 

Qy 599 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 658 

I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 625 CYIFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 684 



Qy 

Db 



659 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 718 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
685 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 744 



Qy 719 LHCTFTLERVSPSTSDIACKLWWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 778 

. I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I !! I I I I I I I I I I I I I I I II I I I I I I I 
Db 745 LHCTFTLERVSPSTSDIACKLWWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 804 

Qy 779 VGPSAFKI PFLI RQKI I S S LDP PCRRGADWRTLAQKLHLDSHLS FFAS KPS PTAMI LNLW 838 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 805 VGPSAFKIPFLIRQKI I SSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPS PTAMI LNLW 864 

Qy 839 EARH F PN GN L S QLAAAVAGLGQ P DAGL FTVS EAE C 873 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 865 EARH F PN GN L S QLAAAVAGLGQ P DAGL FT VS EAEC 899 



RESULT 9 
ADH71642 

ID ADH71642 standard; protein; 899 AA. 
XX 

AC ADH71642; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention NOV21q SEQ ID NO: 538. 
XX 

KW human; cytostatic; immunomodulator ; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 

KW dyslipidaemia. 

XX 

OS Homo sapiens. 
XX 

PN WO2003102155-A2. 
XX 
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PA (CURA-) CURAGEN CORP. 
XX 

PI Alsobrook JP, Alvarez E, Anderson DW, Boldog FL, Casman SJ; 

PI Catterton E, Chapoval A, Crabtree-Bokor JR, Edinger SR, Ellerman K; 

PI Ettenberg S, Gangolli EA, Gerlach VL, Gorman L, Gunther E, Guo X; 

PI Gusev VY, Herrmann JL, Ji W, Kekuda R, Li L, Liu X, Macdougall JR; 

PI Maclachlan T, Malyankar UM, Mezick AJ, Millet I, Mishra VS; 

PI Padigaru M, Patturajan M, Pena CEA, Peyman JA, Raha D, Rastelli L; 

PI Rieger DK, Rothenberg ME, Sciore P, Shenoy SG, Shimkets RA; 

PI Smithson G, Spytek KA, Stone DJ, Vernet CAM, Voss EZ, Zhong M; 

PI Zhong H; 

XX 

DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71641. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmaco genomics . 
XX 

PS Example 21; SEQ ID NO 538; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator, 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics . The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 899 AA; 

Query Match 98.0%; Score 4564.5; DB 8; Length 899; 

Best Local Similarity 98.5%; Pred. No. 0; 

Matches 862; Conservative 3; Mismatches 7; Indels 3; Gaps 3; 

Qy 1 QQSATVANPVPGANPDLLPHFLvTlPEDWIVT^KPv^LVCKAVPATQIFFKCNGEWVRQV 60 

I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 26 QQSAWANPVPGANPDLLPHFLV^PEDVTIVl^NKPvXLVCKAVPATQIFFKCNGEWVRQV 85 

Qy 61 DHVTERSTDGSSGLPTMEV^INVSRQQV^KVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 120 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 86 DHVIERSTDGSSGEPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 145 

Qy 121 RLRKNFEQEPLAKEVSLEQGIVTiPCRPPEGIPPAEv^WLRNEDLVDPSLDPNVYITREHS 180 

M I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 146 RLRKNFEQEPLAKEVSLEQGIVTLPCRPPEGIPPAEVEW 205 

Qy 181 LWRQARLADTAN YT CVAKN I VARRRS AS AAVI VYVNGGW S TWT EW S VC S AS CGRGWQKR 240 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



206 LWRQARLADT AN YT CVAKN I VARRRS AS AAVT VYVN GGW S TWT EW S VC S AS C G RGWQ KR 265 



Qy 241 SRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSD 299 

I I I I I I I I I I I I I I I I I I I I I : I : I I I II I I I I I I I I I I I I I I I I I I I II I II 
Db 266 SRSCTNPAPLNGGAPCEGQ^^\/HDRTVSSLLVSVDGSWSPWSKWSACGLDCTHWRSRECSD 325 

Qy 300 PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 359 

I I I I I I I I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I II I I I I I I I 
Db 326 PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVTUjWGLIAVAVCLVLLLLVLILVYC 385 

Qy 360 RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGP 419 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I II I 

Db 386 RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLS-TTTTYQGSLCPRQDGP 444 

Qy 420 SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 479 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

Db 445 SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 504 

Qy 480 FLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 539 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 505 FLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 564 

Qy 540 GVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWE-DVLHLGEEAPSHLYYCQLEASA 598 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 565 GVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEQDVLHLGEEAPSHLYYCQLEASA 624 

Qy 599 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 658 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I : I I 
Db 625 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKELVQ 684 

Qy 659 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 718 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 685 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 744 

Qy 719 LHCTFTLERVSPSTSD^CKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 778 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 745 LHCTFTLERVSPSTSD1ACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 804 

Qy 779 VGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLW 838 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 805 VGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMI LNLW 864 

Qy 839 EARH FPN GN L S QLAAAVAGLGQ P DAGL FT VS EAE C 873 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 865 EARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 899 

RESULT 10 
ADH71640 

ID ADH71640 standard; protein; 899 AA. 
XX 

AC ADH71640; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention NOV21p SEQ ID NO: 536. 
XX 



KW human; cytostatic; immunomodulator ; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer f s disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 

KW dyslipidaemia. 

XX 

OS Homo sapiens. 
XX 

PN WO2003102155-A2 . 
XX 

PD ll-DEC-2003. 
XX 

PF 03-JUN-2003; 2003WO-US017430 . 
XX 

PR 03-JUN-2002; 2002US-0385120P . 

PR 04-JUN-2002; 2002US-0385784P . 

PR 05-JUN-2002; 2002US-0386041P . 

PR 05-JUN-2002; 2002US-0386047P . 

PR 06-JUN-2002; 2002US-0386376P . 

PR 06-JUN-2002; 2002US-0386453P . 

PR 06-JUN-2002; 2002US-0386864P . 

PR 06-JUN-2002; 2002US-0387016P . 

PR 07-JUN-2002; 2002US-0386796P . 

PR 07-JUN-2002; 2002US-0386816P . 

PR 07-JUN-2002; 2002US-0386931P . 

PR 07-JUN-2002; 2002US-0386942P . 

PR 07-JUN-2002; 2002US-0386971P . 

PR 07-JUN-2002; 2002US-0387262P . 

PR 08-JUN-2002; 2002US-0296960P . 

PR 10-JUN-2002; 2002US-0387400P . 

PR 10-JUN-2002; 2002US-0387535P . 

PR ll-JUN-2002; 2002US-0387610P . 

PR ll-JUN-2002; 2002US-0387625P . 

PR ll-JUN-2002; 2002US-0387634P . 

PR ll-JUN-2002; 2002US-0387668P . 

PR ll-JUN-2002; 2002US-0387696P . 

PR ll-JUN-2002; 2002US-0387702P . 

PR ll-JUN-2002; 2002US-0387836P . 

PR ll-JUN-2002; 2002US-0387859P . 

PR 12-JUN-2002; 2002US-0387933P . 

PR 12-JUN-2002; 2002US-0387934P . 

PR 12-JUN-2002; 2002US-0387960P . 

PR 12-JUN-2002; 2002US-0388022P . 

PR 12-JUN-2002; 2002US-0388096P . 

PR 13-JUN-2002; 2002US-0389123P . 

PR 14-JUN-2002; 2002US-0389118P . 

PR 14-JUN-2002; 2002US-0389120P . 

PR 14-JUN-2002; 2002US-0389144P . 

PR 14-JUN-2002; 2002US-0389146P . 

PR 17-JUN-2002; 2002US-0389729P . 

PR 17-JUN-2002; 2002US-0389742P . 

PR 18-JUN-2002; 2002US-0389884P . 

PR 19-JUN-2002; 2002US-0390006P . 

PR 19-JUN-2002; 2002US-0390209P . 

PR 21-JUN-2002; 2002US-0390763P . 

PR 17-JUL-2002; 2002US-0396706P . 

PR 06-AUG-2002; 2002US-0401628P . 



PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
PI 
PI 
PI 
PI 
PI 
PI 
PI 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 

cc 
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N-PSDB; ADH71639. 

New NOVX polypeptides and nucleic acid molecules useful for preventing or 
treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 
obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 

Example 21; SEQ ID NO 536; 1880pp; English. 

The invention relates to a novel isolated polypeptide (NOVX) . A 
polypeptide of the invention has cytostatic, immunomodulator, 



CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

. CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomi cs . The present sequence 

CC represents a NOVX polypeptide of the invention. 
XX 

SQ Sequence 899 AA; 



Query Match 98.0%; Score 4564.5; DB 8; Length 899; 

Best Local Similarity 98.5%; Pred. No. 0; 

Matches 862; Conservative 3; Mismatches 7; Indels 3; Gaps 3; 



Qy 


i 


QQSATVANPVPGANPDLLPHFLV^PEDWIVTCNKPVLLVCKAVPATQIFFKCNGEWVRQV 


60 






1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


26 


QQ S ATVAN P VP GAN P D LL P H FLVE P EDVYI VKN K P VL L VC KAVP AT Q I F FKCN GEWVRQV 


85 


Qy 


61 


DHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


86 


DHVTERSTDGSSGEPTMEWINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 


145 


Qy 


121 


RLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEv^WLRNEDLVT)PSLDP^^^A r ITREH^ 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


146 


RLRKNFEQEPLAKEVSLEQGIVliPCRPPEGIPPAEV^WLRNEDLVDPSLDPNVYITREHS 


205 


Qy 


181 


LWRQARLADTANYTCVAKNIVARRRSASAAVIVTW 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


206 


LVVRQARLADTANYTCVAKNIVARRRSASAAVI VTW 


265 


Qy 


241 


SRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSD 


299 






1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


266 


SRSCTNPAPLNGGAFCEGQNVHDRTVSSLLVSVDGSWSPWSKWSACGLDCTHWRSRECSD 


325 


Qy 


300 


PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 


359 






1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 




Db 


326 


PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 


385 


Qy 


360 


RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGP 


419 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


386 


RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLS-TTTTYQGSLCPRQDGP 


444 


Qy 


420 


SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 


479 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


445 


SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 


504 


Qy 


480 


FLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 


539 






II 1 II III 1 1 1 1 1 II III II 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 II II II II II II III 




Db 


505 


FLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 


564 


Qy 


540 


GVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWE-DVLHLGEEAPSHLYYCQLEASA 


598 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 


565 


GVXLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEQDVXHLGEEAPSHLYYCQLEASA 


624 


Qy 




CYVr rfcjQLGRr ALVGbALoVAAAi\KLKLLL£ArVACI oLfcj i N J. KV i OJLjnUI nU/\Jjl\Ej V vy 


£ p 




1 1 1 1 ! 1 1 1 1 1 1 1 1 1 II 1 I 1 1 1 1 1 I 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 ! 1 1 II 1 1 1 1 1 1 1 1 




Db 


625 


CYVFTEQLGRFALVGEALSVAAAKRLRLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 


684 


Qy 


659 


LEKQLGGQLIQEPRVXiHFKDSYHNLRLSIHDVPSSLWKSKLLVSYyhlFr YHlWNGiyKi 


/lo 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


685 


LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 


744 


Qy 


719 


LHCT FTLERVS P STS DLACKLWVWQVEGDGQS FS IN FN I TKDTRFAELLALES EAGVPAL 


/ /o 






1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i.i 1 1 1 1 1 1 1 1 1 1 1 




Db 


745 


LHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 


804 


Qy 


779 


VGPSAFKI PFLI RQKI I S SLDPPCRRGADWRTLAQKLHLDSHLS FFASKPS PTAMILNLW 


QQQ 

boo 






• i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II 1 1 II 1 




Db 


805 


VGPSAFKI PFLIRQKI I S SLDPPCRRGADWRTLAQKLHLDSHLS FFASKPS PTAMILNLW 


864 


Qy 


839 


EARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 873 








1 1 1 1 II 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


865 


EARH F PNGN L SQ LAAAVAGLGQ P DAGL FTVS EAE C 899 





RESULT 11 




ADH71632 




ID 


ADH71632 standard; protein; 899 AA. 


XX 






AC 


ADH71632; 




XX 






DT 


25-MAR-2004 


(first entry) 


XX 






DE 


Human protein 


of the invention N0V211 SEQ ID NO: 528. 


XX 






KW 


human; cytostatic; immunomodulator ; neuroprotective; nootropic; 


KW 


anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 


KW 


vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease 


KW 


obesity; diabetes; infectious disease; metabolic syndrome X; 


KW 


dys lipidaemia 




XX 






OS 


Homo sapiens. 




XX 






PN 


WO2003102155- 


A2. 


XX 






PD 


ll-DEC-2003. 




XX 






PF 


03-JUN-2003; 


2003WO-US017430. 


XX 






PR 
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PR 31-OCT-2002; 2002US-0422690P . 

PR 01-NOV-2002; 2002US-0423130P . 

PR 05-NOV-2002; 2002US-00423798 . 

PR 05-NOV-2002; 2002US-0423798P . 

BR 12-NOV-2002; 2002US-0425453P . 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Alsobrook JP, Alvarez E, Anderson DW, Boldog FL, Casman SJ; 

PI Catterton E, Chapoval A, Crabtree-Bokor JR, Edinger SR, Ellerman K; 

PI Ettenberg S, Gangolli EA, Gerlach VL, Gorman L, Gunther E, ( Guo X; 

PI Gusev VY, Herrmann JL, Ji W, Kekuda R, Li L, Liu X, Macdougall JR; 

PI Maclachlan T, Malyankar UM, Mezick AJ, Millet I, Mishra VS; 

PI Padigaru M, Patturajan M, Pena CEA, Peyman JA, Raha D, Rastelli L; 

PI Rieger DK, Rothenberg ME, Sciore P, Shenoy SG, Shimkets RA; 

PI Smithson G, Spytek KA, Stone DJ, Vernet CAM, Voss EZ, Zhong M; 

PI Zhong H; 

XX 

DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71631. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 528; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator , 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics. The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 899 AA; 

Query Match 97.9%; Score 4563.5; DB 8; Length 899; 

Best Local Similarity 98.5%; Pred. No. 0; 

Matches 862; Conservative 2; Mismatches 8; Indels 3; Gaps 3; 

Qy 1 QQSATVANPVPGANPDLLPHFLV^PEDWIVl^KPvXLVCKAVPATQIFFKCNGEWVRQV 60 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | I | | I I I I I 
Db 26 QQ SAT VAN P VPGAN P DLL P H FLVE P E DVY I VKN K P VLLVC KAVP ATQ I FFKCNGEWVRQV 85 



.Qy 61 DHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 86 DHVIERSTDGSSGEPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 145 

Qy 121 RLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 146 RLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 205 

Qy 181 LVVRQARLADTANYTCVAKNIVARRRSASAAVIVYWGGWSTWTEWSVCSASCGRGWQKR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 206 LVVKQARLADTAN YTCVAKN I VARRRS ASAAVI VYWGGWSTWTEWS VC S AS CGRGWQKR 265 

Qy 241 SRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSD 299 

I I I I I I II I I I I I I I I I I I I I . : I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 266 SRSCTNPAPLNGGAFCEGQ1WHDRTVSSLLVSVDGSWSPWSKWSACGLDCTHWRSRECSD 325 

Qy 300 PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 359 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 326 PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 385 

Qy 360 RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGP 419 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 386 RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLS-TTTTYQGSLCPRQDGP 444 

Qy 420 S PKFQLTNGHLLS PLGGGRHTLHHS S PTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 479 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 445 S PKFQLTNGHLLS PLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGT FN 504 

Qy 480 FLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 539 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 505 FLGGRLMIPNTGISLLIPPDVIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 564 

Qy 540 GVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWE-DVLHLGEEAPSHLYYCQLEASA 598 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 565 GVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEQDVLHLGEEAPSHLYYCQLEASA 624 

Qy 599 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 658 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 625 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 684 

Qy 659 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 718 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 685 LEKQLGGQLI QEPRVLHFKDS YHNLRLS I HDVP S SLWKS KLLVS YQEI P FYHIWNGTQRY 744 

Qy 719 LHCTFTLERVSPSTSDIACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 778 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 745 LHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 804 

Qy 779 VGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLW 838 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 805 VGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLW 864 

Qy 839 EARH FPN GN L S Q LAAAVAGLGQ P DAGL FT VS EAEC 873 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 865 EARH FPNGNL S QLAAAVAGLGQ P DAGL FT VS EAEC 899 



RESULT 12 
ADH71638 

ID ADH71638 standard; protein; 899 AA. 
XX 

AC ADH71638; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention N0V21o SEQ ID NO: 534. 
XX 

KW human; cytostatic; immunomodulator ; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 

KW dyslipidaemia . 

XX 

OS Homo sapiens . 
XX 

PN WO2003102155-A2. 
XX 
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XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Alsobrook JP, Alvarez E, Anderson DW, Boldog FL, Casman SJ; 

PI Catterton E, Chapoval A, Crabtree-Bokor JR, Edinger SR f Ellerman K; 

PI Ettenberg S, Gangolli EA f Gerlach VL, Gorman L f Gunther E, Guo X; 

PI Gusev VY, Herrmann JL f Ji W, Kekuda R, Li L, Liu X, Macdougall JR; 

PI Maclachlan T, Malyankar UM, Mezick AJ, Millet I, Mishra VS; 

PI Padigaru M, Patturajan M, Pena CEA, Peyman JA, Raha D, Rastelli L; 

PI Rieger DK, Rothenberg ME, Sciore P, Shenoy SG, Shimkets RA; 

PI Smithson G, Spytek KA, Stone DJ f Vernet CAM f Voss EZ, Zhong M; 

PI Zhong H; 

XX 



DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71637. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 534; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator, 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and, may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics. The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 899 AA; 

Query Match 97.9%; Score 4563.5; DB 8; Length 899; 

Best Local Similarity 98.5%; Pred. No. 0; 

Matches 862; Conservative 2; Mismatches 8; Indels 3; Gaps 3; 

QQSATVANPVPGANPDLLPHFLVEPEDVYIVTCNKPVT.LVCKAVPATQIFFKCNGEWVRQV 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
QQ S AT VAN P VP GAN P DLL PHFLVEPEDVYIVKNKPVLLVCKAVPATQ I FFKCNGEWVRQV 8 5 

DHVI ERSTDGS SGLPTMEVRINVS RQQVEKVFGLEEYWCQCVAWS S SGTTKSQKAYI RI A 120 
I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
DHVIERSTDGSSGEPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 145 

RLRKNFEQEPLAKEVSLEQGI VLPCRPPEGI PPAEVEWLRNEDLVDPSLDPNVYITREHS 180 
I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
RLRKNFEQEPLAKEVSLEQGIVLPCRPPEGI PPAEVEWLRNEDLVDPSLDPNVYITREHS 205 

LWRQARLADTAN YT CVAKN I VARRRS AS AAVI VYVNGGW S T WT EWS VCS AS C GRGWQKR 240 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
LWRQARLADTAN YT CVAKN I VARRRS AS AAVI VYVN GGW S T WT EW S VC S AS C GRGWQ KR 2 65 

SRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSD 299 

111111111111111111111 : I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
SRSCTNPAPLNGGAFCEGQNVHDRTVSSLLVSVDGSWSPWSKWSACGLDCTHWRSRECSD 325 

PAPRNGGEECQGTDLDTRNCTSDLCVliSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 359 
I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I 
PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 385 



Qy 


1 


Db 


26 


Qy. 


61 


Db 


86 


Qy 


121 


Db 


146 


Qy 


181 


Db 


206 


Qy 


241 


Db 


266 


Qy 


300 


Db 


326 



Qy 



360 



RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGP 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



419 



Db 



386 



RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLS-TTTTYQGSLCPRQDGP 444 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 



420 SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 479 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I! I I I I I I 
445 SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 504 

480 FLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 539 

I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
505 FLGGRLMI PNTGISLLIPPDAI PRGKIYEI YLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 564 

5*40 GVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWE-DVLHLGEEAPSHLYYCQLEASA 598 

I I I I I I M I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
565 GVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEQDVLHLGEEAPSHLYYCQLEASA 624 

599 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 658 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
625 CYVFTEQLGRFALVGEALSVAATKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 684 



659 



718 



LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
685 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 744 

719 LHCTFTLERVSPSTSDIACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 778 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
745 LHCTFTLERVSPSTSDIACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 804 

779 VGPSAFKI PFLI RQKI IS S LDPPCRRGADWRTLAQKLHLDSHLS FFAS KPS PTAMI LNLW 838 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
805 VGPSAFKI PFLI RQKI I SSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMI LNLW 864 

839 EARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 873 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
865 EARH FPN GNL S Q LAAAVAGL GQ P DAGL FT VS EAE C 899 
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DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71633. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 530; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator, 

CC neuroprotective,, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics. The present sequence 



CC represents a NOVX polypeptide of the invention. 
XX 

SQ Sequence 899 AA; 



Query Match 97.9%; Score 4562.5; DB 8; Length 899; 

Best Local Similarity 98.5%; Pred. No. 0; 

Matches 862; Conservative 2; Mismatches 8; Indels 3; Gaps 3; 



Qy 
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QQ S ATVAN P VP GAN P D L L PH FLVE P EDVY I VKNK P VLLVCKAVP ATQ I F FKCN GEWVRQV 


60 






1 1 1 1 1 1 1 1 1 1 1 1 II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


26 


QQSATVANPVPGANPDLLPHFLV^PEDWIVia^KPVLLVCKAVPATQIFFKCNGEWVRQV 


85 


Qy 


61 


DHVI ERSTDGS S GL PTMEVRINVS RQQVEKVFGLEE YWCQCVAWS S S GTTKSQKAYI RI A 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


86 


DHVI ERST DGS S GE PTMEVRI NVS RQQVEKVFGLEE YWCQCVAWS S S GTT KS QKAYI RI A 
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Qy 
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RLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAEVEWLRNEDLVT>PSLD 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
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RLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAEV^WLRNEDLVDPSLDPNVYITREHS 
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LWRQARI^TANYTCVAKNIVARRRSASAAVIVTWGGWSTWTEWSVCSASCGRGWQKR 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
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SRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSD 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
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266 


S RS CTN P AP LNGGAFC EGQNVH DRT VS S L L VS VDG S W S P W S KW S AC GL D CTHWRS RE C S D 


325 


Qy 
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PAPRNGGEECQGTDLDTRNCTSDLCVHSA5GPEDVALYVGLIAVAVCLVLLLLVLILVYC 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


326 


PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 
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Qy 


360 


RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGP 


419 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


386 


RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLS-TTTTYQGSLCPRQDGP 


444 


Qy 


420 


SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 


479 






1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 
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S PKFQLTNGHLLS PLGGGRHTLHHS S PT S EAEEFVS RLSTQNYFRS LPRGTSNMT YGTFN 
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Qy 
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FLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLS PIVSCGPP 


539 






i i i i i i i i j i i iii ii i'ii i i i i t t i i t i i t i i i i i i i i i i i i i i i i i i i it ■ i i i ■ ■ it 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
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FLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLS PIVSCGPP 
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GVXLTRPVTLAMDHCGEPSPDSWSLRLKKQSCEGSWE-DVLHLGEEAPSHLYYCQLEASA 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 
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CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
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LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 
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III M II II II II 1 II II II II II 1 III III Mill 1 | 1 | | | | | | | | | | | || Mill | | 1 
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LHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 
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Db 



I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
745 LHCTFTLERVSPSTSDIACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 804 



Qy 779 VGPSAFKI PFLIRQKI I S SLDPPCRRGADWRTLAQKLHLDSHLS FFASKPS PTAMILNLW 838 

I I I I I I I I I I I I I I I Ml I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 

Db 805 VGPSAFKI PFLIRQKI I SSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPS PTAMILNLW 864 

Qy 839 EARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 873 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 865 EARH FPNGNLSQLAAAVAGLGQ P DAGL FTVS EAEC 899 
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PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmaco genomics . 
XX 

PS Example 21; SEQ ID NO 544; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator, 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics . The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 899 AA; 



Query Match 97.9%; Score 4560.5; DB 8; Length 899; 

Best Local Similarity 98.5%; Pred. No. 0; 

Matches 862; Conservative 2; Mismatches 8; Indels 3; Gaps 3 

Qy 1 QQ S ATVAN P VP GAN P DLL P H FLVE P E DVY I VKN K P VL LVC KAVP ATQ I FFKCN GEWVRQV 60 

I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I i 
Db 26 QQSAWANPVPGANPDLLPHFLV^PEDVYIvTOKPVTjLVCKAVPATQIFFKCNGEWVRQV 85 

Qy 61 DHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 120 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 86 DHVIERSTDGSSGEPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 145 

Qy 121 RLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPA£VEWLRNEDLVI)PSLDPNWITREHS 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 146 RLRKNFEQEPL7^EVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 205 

Qy 181 LWRQARLADTANYTCVAKNIVARRRSASAAVIWW 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 206 L WRQARLADT AN YT C VAKN I VARRRS AS AAVI VYVN GGW S T WT EW S VC S AS C G RGWQ KR 265 



Qy 241 SRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSD 299 

I I I I I I I I I I I I I I I I I I I I I : I : I I I I I I II I I I II I I I I I I I I I I I I I I I I 
Db 266 SRSCTNPAPLNGGAFCEGQ^^WDRWSSLLVSVDGSWSPWSKWSACGLDCT^^WRSRECSD 325 

Qy 300 PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 359 

I I I I I I I I I I I I I I I I I I I I I I.I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 326 PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 385 

Qy 360 RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGP 419 

I I I I I I I I II I I I I I I I I I I I I I I I I II I I M I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 386 RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLS-TTTTYQGSLCPRQDGP 444 

Qy 420 SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 479 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 445 SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 504 

Qy 480 FLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 539 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I 
Db 505 FLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 564 

Qy 540 GVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWE-DVLHLGEEAPSHLYYCQLEASA 598 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I 
Db 565 GVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEQDVLHLGEEAPSHLYYCQLEASA 624 

Qy 599 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 658 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 625 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 684 

Qy 659 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 718 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 685 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 744 

Qy 719 LHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 778 

I I II I I I I II I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 745 LHCTFTLERVSPSTSDLACKLVAAVQVEGGGQSFSINFNITKDTRFAELLALESE1AGVPAL 804 

Qy 779 VGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLW 838 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 805 VGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLW 864 

Qy 839 EARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 873 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 865 EARH FPNGN LS Q LAAAVAGLGQ P DAGL FT VS EAE C 899 
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ADH71630 

ID ADH71630 standard; protein; 899 AA. 
XX 

AC ADH71630; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention NOV21k SEQ ID NO: 526. 
XX 

KW human; cytostatic; immunomodulator ; neuroprotective; nootropic; 
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anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 
vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease; 
obesity; diabetes; infectious disease; metabolic syndrome X; 
dyslipidaemia . 

Homo sapiens. 
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N-PSDB; ADH71629. 

New NOVX polypeptides and nucleic acid molecules useful for preventing or 
treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 
obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 

Example 21; SEQ ID NO 526; 1880pp; English. 

The invention relates to a novel isolated polypeptide (NOVX) . A 
polypeptide of the invention has cytostatic, immunomodulator, 
neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 



( 



CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics . The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 899 AA; 



Query Match 97.9%; Score 4560.5; DB 8; Length 899; 

Best Local Similarity 98.5%; Pred. No. 0; 

Matches 862; Conservative 2; Mismatches 8; Indels 3; Gaps 3; 



Qy 


l 


QQSATVANPVPGANPDLLPHFLVEPEDWIVl<NKPvXLVCKAVPATQIFFKCNGEWVRQV 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


26 


QQSATVANPVPGANPDLLPHFLVEPEDVTIVl^NKPVTiLVCKAVPATQIFFKCNGEWRQV 


85 


Qy 


61 


DHVI ERSTDGS SGLPTMEVl^INVSRQQ V^KVFGLEEYWCQCVAWS S SGTTKSQKAYI RIA 


120 






1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I 1 1 1 1 I 1 




Db 


86 


DHVIERSTDGSSGEPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 


145 


Qy 


121 


RLRKNFEQEPLAKEVSLEQGI VXPCRPPEGIPPAEV^WLRNEDLVDPSLDPNVYITREHS 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


146 


RLRKNFEQEPLAKEVSLEQGI VXPCRPPEGIPPAEV^WLRNEDLVT)PSLDPNVYITREHS 


205 


Qy 


181 


LWRQARLADTANYTCVAKNIVARRRSASAAVIVTWGGWSTWTEWSVCSASCGRGWQKR 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


206 


LWRQARLADTAN YT CVAKN I VARRRS AS AAVI VYVNGGW STWT EWS VC S AS CGRGWQKR 


265 


Qy 


241 


SRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSD 


299 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


266 


S RS CTNPAPLNGGAFCEGQNVHDRTVS S LLVSVDGS WS PWS KWS ACGLDCTHWRS RECS D 


325 


Qy 


300 


PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 


359 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


326 


PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVPILVYC 


385 


Qy 


360 


RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGP 


419 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 '1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


386 


RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLS-TTTTYQGSLCPRQDGP 


444 


Qy 


420 


SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 


479 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


445 


SPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFN 


504 


Qy 


480 


FLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 


539 






1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


505 


FLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 


564 


Qy 


540 


GVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWE-DVLHLGEEAPSHLYYCQLEASA 


598 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 565 GVLLTRPVTLAMDHCGEPSPDSWSLRLKKQSCEGSWEQDVLHLGEEAPSHLYYCQLEASA 624 

Qy 599 CYVFTEQLGRFALVGEALSYAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 658 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 625 CWFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 684 

Qy 659 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDWSSLWKSKLLVSYQEIPFYHIWNGTQRY 718 

I I I I t I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 685 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 744 

Qy 719 LHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 778 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 745 LHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 804 

Qy 779 VGPSAFKI PFLI RQKI I SSLDPPCRRGADWRTLAQKLHLDSHLS FFASKPS PTAMI LNLW 838 

I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db • 805 VGPSAFKI P FLI RQKI I S S LDP PCRRGADWRTLAQKLHLDSHLS FFASKPS PTAMI LNLW 864 

Qy 839 EARH F PN GN L SQLAAAVAGLGQ P DAGL FTVS EAEC 873 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 8 65 EARH FPN GN L SQLAAAVAGLGQ P DAGL FTVS EAEC 899 



Search completed: March 1, 2005, 08:56:45 
Job time : 202.724 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



March 1, 2005, 08:42:47 ; Search time 52.3087 Seconds 

(without alignments) 
1245.848 Million cell updates/sec 

US-10-624-932-2_COPY_26_898 
4660 

1 QQSATVANPVPGANPDLLPH AVAGLGQ P D AGL FT VS EAEC 873 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 513545 seqs, 74649064 residues 

Total number of hits satisfying chosen parameters: 



513545 



Minimum DB seq length: 
Maximum DB seq length: 



Post-processing: 



2000000000 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_AA: * 

1: /cgn2_6/ptodata/l/iaa/5A_COMB.pep:* 

2 : /cgn2_6/ptodata/ 1/iaa/ 5B_COMB . pep : * 

3 : /cgn2_6/ptodata/ 1/iaa/ 6A_COMB . pep : * 

4 : /cgn2_6/ptodata/l/iaa/6B_COMB.pep:* 

5: / cgn2_6/ptodata/ 1/iaa/ PCTUS_COMB. pep:* 

6: /cgn2_6/ptodata/l/iaa/backf ilesl .pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-08-808-982-5 

Sequence 5, Application US/08808982 
Patent No. 5939271 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
APPLICANT: Leonardo, E. David 
APPLICANT: Hink, Lindsay 
APPLICANT: Masu, Masayuki 
APPLICANT: Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES : 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 



ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/808, 982 

FILING DATE: 

CLASSIFICATION: 530 
; ATTORNEY/AGENT INFORMATION: 

NAME: OSMAN, RICHARD A 

REGISTRATION NUMBER: 36,627 

REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 343-4341 

TELEFAX: (415) 343-4342 
; INFORMATION FOR SEQ ID NO: 5: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 898 amino acids 

; TYPE: amino acid 

STRANDEDNESS: not relevant 
; TOPOLOGY: not relevant 

MOLECULE TYPE: peptide 
US-08-808-982-5 

Query Match 96.8%; Score 4511; DB 2; Length 898; 

Best Local Similarity 96.0%; Pred. No. 0; 

Matches 838; Conservative 17; Mismatches 18; Indels 0; Gaps 0; 
Qy 1 QQSATVANPVPGANPDLLPHFLV^PEDVYIVXNKPVLLVCKAVPATQIFFKCNGEWVRQV 60 

I! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I 

Db 26 QQSATVANPVPGANPDLLPHFLV^PEDVTIVl^KPVXLVCKAVPATQIFFKCNGEWVRQV 85 

Qy 61 DHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 120 

I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I II I I I II I II I I I I I I I I I I I I I I I I I 
Db 86 DHVIERSTDSSSGLPTMEVl^INVSRQQV^KVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 145 

Qy 121 RLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 146 YLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAEV^WLRNEDLVT)PSLDPNVYITREHS 205 

Qy 181 LWRQARLADTAN YT CYAKN I VARRRS AS AAVI VYVN GGW S TWT EW SVC S AS CGRGWQ KR 24 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 206 LWRQARLADTAN YT CVAKN I VARRRS T S AAVI VYVNGGW S TWT EWS VC S AS C GRGWQ KR 265 

Qy 241 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 300 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 266 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSSWSKWSACGLDCTHWRSRECSDP 325 

Qy 301 APRNGGEECQGTDLDTRNCTSDLCVliSASGPEDVALWGLIAVAVCLVLLLLVLILVYCR 360 

111111111:1 I I I I I I I I I I I I : I : I I I I I I I I I : I I : I I I I I I I I I I I hill 
Db 326 APRNGGEECRGADLDTRNCTSDLCLHTASCPEDVALYIGLVAVAVCLFLLLLALGLIYCR 385 

Qy 361 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 420 

I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 386 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCSRQDGPS 445 



Qy 421 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 480 

I I I I I : I I I I I I I I I I I M I I I I I I I I I I I : I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 446 PKFQLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVSRLSTQNYFRSLPRGTSNMAYGTFNF 505 

Qy 481 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 540 

I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I 
Db 506 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPWSCGPPG 565 

Qy 541 VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I III 
Db 566 VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEESPSHLYYCQLEAGACY 625 

Qy 601 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 660 

I I I I I I I I I I I I I I I I I I I I I I I : I I I II I I I I I I I I I I I I I I I II I I I II I I I I I II I 
Db 626 VFTEQLGRFALVGEALSVAATKRLRLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 685 

Qy 661 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 720 

MM tlllllll MIIIIIMIIIIMMI II Mill llllli III II IIIIMM:IM 
Db 686 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQQYLH 745 

Qy 721 CTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVG 780 

I I I I I I I : : I I I I I I I I : I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 746 CTFTLERINASTSDIACKVWVWQVEGDGQSFNINFNITKDTRFAELLALESEGGVPALVG 805 

Qy 781 PSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 840 

I I I I I I II I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 806 PSAFKIPFLIRQKIIASLDPPCSRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 865 

Qy 841 RHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 873 

I I I I I I I I I I I I I II I I I I I II II I II I I I I I 
Db 866 RHFPNGNLGQLAAAVAGLGQPDAGLFTVSEAEC 898 



RESULT 2 

US-09-306-902A-5 

; Sequence 5, Application US/09306902A 
; Patent No. 6277585 

GENERAL INFORMATION: 
; APPLICANT: Tessier-Lavigne, Marc 

; Leonardo, E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

TITLE OF INVENTION: Netrin Receptors 
; NUMBER OF SEQUENCES: 9 

; • CORRESPONDENCE ADDRESS: 

; ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 



SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/306, 902A 
FILING DATE: 07-May-1999 
CLASSIFICATION: <Unknown> 
; ATTORNEY/ AGENT INFORMATION: 

NAME: OSMAN, RICHARD A 
; REGISTRATION NUMBER: 36, 627 

REFERENCE/ DOCKET NUMBER: UC96-217 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 898 amino acids 
; TYPE: amino acid 

; STRANDEDNESS: not relevant 

; TOPOLOGY: not relevant 

MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-306-902A-5 



Query Match 96.8%; Score 4511; DB 3; Length 898; 

Best Local Similarity 96.0%; Pred. No. 0; 

Matches 838; Conservative 17; Mismatches 18; Indels 0; Gaps 0; 



Qy 


1 


QQ SAT VAN P VPGAN P DLL PH FLVE P EDVY I VKN KP VL LVC KAVP ATQ I FFKCN GEWVRQV 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 




Db 


26 


QQ SATVANP VPGAN P DLL PH FLVE P EDVYIVKNKPVL LVC KAVP ATQ I FFKCNGEWVRQV 


85 


Qy 


61 


DHVI ERSTDGS S GLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWS S SGTTKSQKAYI RIA 


120 






1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 




Db 


86 


DHVIERSTDSSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 


145 


Qy 


121 


RLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 




Db 


146 


YLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 


205 


Qy 


181 


LWRQARLADTAN YT CVAKN I VARRRS AS AAVI VYVN GGW S TWT EWS VC S AS C GRGWQKR 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii mini ii n ii 1 1 1 1 1 1 1 1 1 mi n n 1 1 1 1 1 1 1 1 1 




Db 


206 


LVWQARLADTANYT CVAKN I VARRRST S AAVI VYVTtfGGWS 


265 


Qy 


241 


SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


266 


SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSSWSKWSACGLDCTHWRSRECSDP 


325 


Qy 


301 


APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 


360 






111111111:1 1 II II 1 1 II II 1 : 1 : 1 1 1 1 1 1 1 1 1 : 1 1 : 1 1 1 1 1 1 1 1 1 1 1 hill 




Db 


326 


APRNGGEECRGADLDTRNCTSDLCLHTASCPEDVALYIGLVAVAVCLFLLLLALGLIYCR 


385 


Qy 


361 


KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 


420 






1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II II II 1 1 II II 1 1 II 1 1 1 1 1 1 




Db 


386 


KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCSRQDGPS 


445 


Qy 


421 


PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 


480 






1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


446 


PKFQLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVSRLSTQNYFRSLPRGTSNMAYGTFNF 


505 



Qy 

Db 



481 
506 



LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I t I I I I I I I : I I I I I I I 
LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPWSCGPPG 



540 
565 



Qy 541 VXLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVXHLGEEAPSHLYYCQLEASACY 600 

I I I I I I I I I I I I I I 1 I I I I I I I I 1 I I I I I I I I I I I I I I I I I II I : I I I I I I I I I I I III 
Db 566 VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEESPSHLYYCQLEAGACY 625 

Qy 601 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 660 

I I I I I I I I I I I I I I I I II I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 626 VFTEQLGRFALVGEALSVAATKRLRLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 685 

Qy 661 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I : I I I 
Db 686 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQQYLH 745 

Qy 721 CTFTLERVS PST SDLACKLWVWQVEGDGQS FS IN FNI TKDTRFAELLALES EAGVPALVG 780 

lllllll:: I I II I I I I : I II I I I I I I I I I : I I I I II I I I I I I I I I I I I I I lllllll 
Db 746 CTFTLERINASTSDLACKVWVWQVEGDGQSFNINFNITKDTRFAELLALESEGGVPALVG 805 

Qy 781 PSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 840 

I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 806 PSAFKIPFLIRQKIIASLDPPCSRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 865 

Qy 841 RHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 873 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 866 RHFPNGNLGQLAAAVAGLGQPDAGLFTVSEAEC 898 

RESULT 3 
US-08-808-982-6 

Sequence 6, Application US/08808982 
Patent No. 5939271 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
APPLICANT: Leonardo , E. David 
APPLICANT: Hink, Lindsay 
APPLICANT: Masu, Masayuki 
APPLICANT: Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/808,982 
FILING DATE: 



CLASSIFICATION: 530 
ATTORNEY/AGENT INFORMATION: 

NAME: OSMAN, RICHARD A 

REGISTRATION NUMBER: 36,627 
; REFERENCE/ DOCKET NUMBER: UC96-217 

TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 343-4341 

TELEFAX: (415) 343-4342 
; INFORMATION FOR SEQ ID NO: 6: 
; SEQUENCE CHARACTERISTICS: 
; LENGTH: 557 amino acids 

; TYPE: amino acid 

STRANDEDNESS: not relevant 
; TOPOLOGY: not relevant 

MOLECULE TYPE: peptide 
US-08-808-982-6 

Query Match 60.4%; Score 2815.5; DB 2; Length 557; 

Best Local Similarity 96.8%; Pred. No. 3.5e-258; 

Matches 539; Conservative 2; Mismatches 15; Indels 1; Gaps 1; 



Qy 318 N CT S D L CVH S AS G P EDVAL YVGL I AVAVC LVLLL LVL I LVYCRKKEGL D S DVAD S S I LT S 377 

I I II II I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 NCTSDLXVHTASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTS 60 

Qy 378 GFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGG 437 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 GFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGG 120 

Qy 438 RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIP 497 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIP 180 

Qy 498 PDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEP 557 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 PDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEP 240 

Qy 558 SPDSWSLRLKKQSCEGSWEDVXHLGEE7VPSHLYYCQLEASACYVFTEQLGRFALVGEALS 617 

I II II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 SPDSWSLALKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACWFTEQLGRFALVGE^LLS 300 

Qy 618 VAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFK 677 

I I I I I I I I I I I II II I I I I I I I I II I I I II I I I I I I I II I I I I I I I I I II I I I I I I I I 

Db 301 VAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHLX 360 

Qy 678 DS YHNLRLS IHDVP S S LWKS KLLVS YQEI P FYH I WNGTQRYLHCT FTLERVS P ST S DLAC 737 

II I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 DS YHNLXLSXHDVPSSLWKSKLLVS YQEI PFYHIWNGTQRYLHCT FTLERVS PSTSDLAC 420 

Qy 738 KLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISS 797 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I H I I 

Db 421 KLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISS 480 

Qy 798 LDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQIJWVVAG 857 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 LDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAG 540 



Qy 858 LGQPDAGLFT-VSEAEC 873 

I : I I I I I 
Db 541 TXPAGRWLLSQCSEAEC 557 



RESULT 4 

US-09-306-902A-6 

; Sequence 6, Application US/09306902A 

; Patent No. 6277585 

; GENERAL INFORMATION: 

; APPLICANT: Tessier-Lavigne, Marc 

; Leonardo , E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/306, 902A 
; . FILING DATE: 07-May-1999 

; CLASSIFICATION: <Unknown> 

ATTORNEY/AGENT INFORMATION: 
; NAME: OSMAN, RICHARD A 

REGISTRATION NUMBER: 36,627 

REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (415) 343-4341 

TELEFAX: (415) 343-4342 
; INFORMATION FOR SEQ ID NO: 6: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 557 amino acids 

; TYPE: amino acid 

STRANDEDNESS : not relevant 
; TOPOLOGY: not relevant 

; MOLECULE TYPE: peptide 

; SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

US-09-306-902A-6 

Query Match 60.4%; Score 2815.5; DB 3; Length 557; 

Best Local Similarity 96.8%; Pred. No. 3.5e-258; 

Matches 539; Conservative 2; Mismatches 15; Indels 1; Gaps 1; 



QY 
Db 



318 NCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTS 377 

I I I I I I I I : I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 N CT S D LXVHT AS G P E DVAL YVGL I AVAVC LVL LL LVL I LVYC RKKEGL D S DVAD S S I LT S 60 



Qy 378 GFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGG 437 

I I I I I I I I 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I.I I I I I I I I I I I I I I I I I i I I I I 

Db 61 GFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGG 120 

Qy 438 RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIP 497 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIP 180 

Qy 498 PDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVT LAMDHCGEP 557 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 PDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLS PI VSCGPPGVLLTRPVI LAMDHCGEP 240 

Qy 558 SPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALS 617 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 SPDSWSLALKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALS 300 

Qy 618 VAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFK 677 

I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 VAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHLX 360 

Qy 678 DSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLAC 737 

I I I I I I II I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 DSYHNLXLSXHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLAC 420 

Qy 738 KLWWQVEGDGQS FS INFNI TKDTRFAELLALES EAGVPALVGP SAFKI PFLI RQKI I S S 797 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 KLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISS 480 

Qy 798 LDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAG 857 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 LDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQ1AAAVAG 540 

Qy 858 LGQPDAGLFT-VSEAEC 873 

I : I I I I I 

Db 541 TXPAGRWLLSQCSEAEC 557 



RESULT 5 
US-08-808-982-7 

Sequence 7, Application US/08808982 
Patent No. 5939271 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
APPLICANT: Leonardo, E. David 
APPLICANT: Hink, Lindsay 
APPLICANT: Masu, Masayuki 
APPLICANT: Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 



COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.30 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/808, 982 
FILING DATE: 
CLASSIFICATION: 530 
ATTORNEY/ AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (415) 343-4341 

TELEFAX: (415) 343-4342 
; INFORMATION FOR SEQ ID NO: 7: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 943 amino acids 

; TYPE: amino acid 

STRANDEDNESS: not relevant 
TOPOLOGY: not relevant 
MOLECULE TYPE: peptide 
US-08-808-982-7 

Query Match 54.9%; Score 2560.5; DB 2; Length 943; 

Best Local Similarity 54.0%; Pred. No. 1.4e-233; 

Matches 496; Conservative 139; Mismatches 215; Indels 69; Gaps 14; 



Qy 11 PGANPDLLPHFLV^PEDvY I VKNKPVLLVCKAVPATQI FFKCNGEWVRQVDHVI ERSTDG 70 

II : I I I I I : I I I I I I I I I I I I I I : I I I I I I : I I I I I I I I I I I : I I 
Db 38 PSAPAEQLPHFLLEPEDAYIVKNKPVELHCRAFPATQIYFKCNGEWVSQKGHVTQESLDE 97 

Qy 71 SSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEP 130 

: : I I I I : I I I I I I I I I I I I : I I I I I I I I I I I I I I I I :: I I I I I I I I I I I : I I I 
Db 98 ATGLRIREVQIEVSRQQVEELFGLEDYWCQCVAWSSSGTTKSRRAYIRIAYLRKNFDQEP 157 

Qy 131 LAKEVSLEQGI VXPCRPPEGIPPAEVTSWLRNEDLVTDP 190 

I I I I I I : : : I I I I I I I : I I I I I I I : I I I : : I I : I I : I : I : I : : I I I I I : I 
Db 158 LAKEVPLDHEVLLQCRPPEGVPVAEVEWLKNEDVIDPAQDTNFLLTIDHNLIIRQARLSD 217 

Qy 191 TAN YT CVAKN I VARRRS ASAAVI VYVNGGW STWT EWS VC S AS CGRGWQKRS RS CTN PAP L 250 

I I I I I I I I I I I II : I I I : I 1111111111:1 Ml II I I I I I I I I : I : I I I I I I I 
Db 218 TANYTCVAKNIVAKRRSTTATVIVTWGGWSSWAEWSPCSNRCGRGWQKRTRTCTNPAPL 277 

Qy 251 NGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQ 310 

I I I I I I I I I I I I I I I : I I I I I : I : I I I I I I I : I I I I I I I I I I : I I I : I 

Db 278 NGGAFCEGQACQKTACTTVCPVDGAWTEWSKWSACSTECAHWRSRECMAPPPQNGGRDCS 337 

Qy 311 GTDLDTRNCTSDLCV HSASGPE DVALYVGL- 1 AVAVCLVLLLLVLI LVY 358 

II ll::IU III : : I: I I I I I I I : I I I I : I : I :: I I 

Db 338 GTLLDSKNCTDGLCVXNQRTLNDPKSRPLEPSGDVALYAGLWAVFVVLAVXMAVGVIW 397 

Qy 359 CRKKEGLDSDVADS S- 1 LTSGFQPVS I KPSKADNPHLL — TIQPDLSTTTTTYQGSLCPR 415 

I I : I : I II 111111:1:: I I I I : III:: I : I : 

Db 398 RRNCRDFDTDITDSSAALTGGFHPVNFKTARPSNPQLLHPSAPPDLTASAGIYRGPVYAL 457 



Qy 

Db 



416 QDGPSPKFQLTNGHLLSPL GGG 437 

-11:1:111111 II 
458 QDS-ADKIPMTNSPLLDPLPSLKIKVYDSSTIGSGAGLADGADLLGVLPPGTYPGDFSRD 516 



Qy 438 RHTLHHS SPTSEAEEFVSRLSTQNYFRSLPRGTSNMT YGTFNFLGGRLMI PNTGI SLLI P 497 

III I : I : I : III I : III I I I I I I I I I : I I I : I 

Db 517 THFLHLRS ASLGSQ-HLLGLPRDPSSSVSGTFGCLGGRLTIPGTGVSLLVP 566 

Qy 498 PDAIPRGKIYEIYLTLHKPEDVRLPLA-GCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGE 556 

111:11 I :: I I ::| I III: I I I : I I I hill I : I I I I I : I : II I 
Db 567 NGAIPQGKFYDLYLRINKTEST-LPLSEGSQTVLSPSVTCGPTGLLLCRPWLTVPHCAE 625 

Qy 557 PSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEAL 616 

I : I I I : : I I I : I : I I I : II I I I I : I : : : I I I : I I : 
Db 626 VIAGDWIFQLKTQAHQGHWEEWTLDEETLNTPCYCQLEAKSCHILLDQLGTYVFTGESY 685 

Qy 617 SVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHF 676 

I : I I I I : I : II I I I I I I I : : I I I I I II I I I I I : : I I : III I :: I I : I I 
Db 686 SRSAVKRLQLAIFAPALCTSLEYSLRVYCLEDTPAALKEVLELERTLGGYLVEEPKTLLF 745 

Qy 677 KDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLA 736 

I I I I I I I I I I : I I : I : I : I I I I I I I I I I I I : I I I • I : I I I I I II I I I : : : : 

Db 746 KDSYHNLRLSLHDI PHAHWRSKLLAKYQEI PFYHVWNGSQKALHCTFTLERHSLASTEFT 805 

Qy 737 CKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL — VGPSAFKIPFLIRQKI 794 

II : I I I I I : I I I : : : : I I I I I I I : I I I I I I I I I II I 

Db 806 CKVCVRQVEGEGQI FQLHTTLA-ETPAGSLDALCSAPGNAATTQLGP YAFKI PLS I RQKI 864 

Qy 795 ISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAA 854 

: I I I I II III I I I I I : I : I : : I I : I III : I I : I I I I I : I : I : I I : I 
Db 8 65 CNSLDAPNSRGNDWRLLAQKLSMDRYLNYFATKASPTGVILDLWEARQQDDGDLNSLASA 924 

Qy 855 VAGLGQ P DAGLFTVS EAEC 873 

Db 925 LEEMGKSEMLVAMTTDGDC 943 



RESULT 6 

US-09-306-902A-7 

; Sequence 7, Application US/09306902A 
; Patent No. 6277585 

GENERAL INFORMATION: 
; APPLICANT : Tessier-Lavigne, Marc 

; Leonardo, E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

; TITLE OF INVENTION: Netrin Receptors 

; NUMBER OF SEQUENCES: 9 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

STREET: 268 BUSH STREET, SUITE 3200 

CITY: SAN FRANCISCO 

STATE.: CALIFORNIA 

COUNTRY: USA 

ZIP : 94104 
COMPUTER READABLE FORM: 



MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: Patentln Release #1.0 , Version #1.30 

CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/09/306, 902A 

FILING DATE: 07-May-1999 
CLASSIFICATION: <Unknown> 
ATTORNEY/AGENT INFORMATION: 
; NAME: OSMAN, RICHARD A 

REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 943 amino acids 
; TYPE: amino acid 

; STRANDEDNESS: not relevant 

; TOPOLOGY: not relevant 

MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
US-09-306-902A-7 

Query Match 54.9%; Score 2560.5; DB 3; Length 943; 

Best Local Similarity 54.0%; Pred. No. 1.4e-233; 

Matches 496; Conservative 139; Mismatches 215; Indels 69; Gaps 14; 



Qy 11 PGANPDLLPHFLV^PEDWIvT^KPvXLVCKAVPATQIFFKCNGEWVl^QVT>HVIERSTDG 70 

II : I I I I I : I I I I I I I I I I I I I I : I I I I I I : I I I I I I I I I II : II 
Db 38 PSAPAEQLPHFLLEPEDAYIVKNKPVELHCRAFPATQIYFKCNGEWVSQKGHVTQESLDE 97 

Qy 71 SSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEP 130 

: : I I I I : I I I I I I I I : : I I I I : I I I I I I I I I I I I I I I I : : I I I I I I I I I I I : I I I 
Db 98 ATGLRIREVQIEVSRQQVEELFGLEDYWCQCVAWSSSGTTKSRRAYIRIAYLRKNFDQEP 157 

Qy 131 LAKEVSLEQGIVXPCRPPEGIPPAEVEWLRNEDLVT^^ 190 

I I I I I I : : : I I I I I I I : I I I I I I I : I I I : : I I : I I :| : I : I : : I I I I I : I 

Db 158 LAKEVPLDHEVXLQCRPPEGVPVAEVEWLKNEDVIDPAQDTNFLLTIDHNLIIRQARLSD 217 

Qy 191 TANYTCVAKNIVARRRSASAAVIVTWGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPL 250 

II I I I I I I I I I I I : I I I : I I I I I I I I I I I : I III II I I I I I I I I : I : I I I I I I I 

Db 218 TANYTCVAKNIVAKRRSTTATVTWWGGWSSWAEWSPCSNRCGRGWQKRTRTCTNPAPL 277 

.Qy 251 NGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQ 310 

I I I I I I I I I I I II I I : I I I I I : I : I I I I I I I : I I I I I I I I I I : I I I : I 
Db 278 NGGAFCEGQACQKTACTTVC PVDGAWTEWS KWSACSTECAHWRS RECMAP P PQNGGRDCS 337 

Qy 311 GTDLDTRNCTSDLCV HSASGPE DVALYVGL- IAVAVCLVLLLLVLILVY 358 

I I I I :: I I I III : : I : I I I I I I I : I I I I : I : I : : I I 

Db 338 GT LLD S KNCTD GLCVLNQRT LNDPKSRPLEP S GDVAL YAGLWAVFVVLAVXMAVG VT VY 397 

Qy 359 CRKKEGLDSDVADSS-ILTSGFQPVSIKPSKADNPHLL — TIQPDLSTTTTTYQGSLCPR 415 

I I : I : I I I 111111:1:: MM : III: • I : I : 

Db 398 RRNCRDFDTDITDSSAALTGGFHPWFKTARPSNPQLLHPSAPPDLTASAGIYRGPVYAL 457 



Qy 


416 


QDGP S PKFQLTNGHLLS PL GGG 

1 1 : 1 : 1 1 1 1 1 1 II 

QDS-ADKIPMTNSPLLDPLPSLKIKVYDSSTIGSGAGLADGADLLGVLPPGTYPGDFSRD 


437 


Db 


458 


516 


Qy 


438 


RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIP 

III 1 : 1 : 1 : III 1 : III 1 1 1 1 1 1 1 1 1 : 1 1 1 : 1 
THFLHLRS ASLGSQ-HLLGLPRDPSSSVSGTFGCLGGRLTIPGTGVSLLVP 


497 


Db 


517 


566 


Qy 


498 


PDAIPRGKIYEIYLTLHKPEDVRLPLA-GCQTLLSPIVSCGPPGVLLTRPVILAMDHCGE 

1 1 1 : 1 1 1 : : 1 1 : : 1 1 III: 1 1 1 : 1 1 1 hill 1 : 1 1 111 = 1 ' II 1 
NGAIPQGKFYDLYLRINKTEST-LPLSEGSQTVLSPSWCGPTGLLLCRPVVLTVPHCAE 


556 


Db 


567 


625 


Qy 


557 


PSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACWFTEQLGRFALVGEAL 

1 : 1 1 1 : : 1 1 1 : 1 : 1 1 1 : 1 1 1 1 1 1 : 1 : : : 1 1 1 : 1 1 : 
VIAGDWIFQLKTQAHQGHWEEWTLDEETLNTPCYCQLEAKSCHILLDQLGTYVFTGESY 


616 


Db 


626 


685 


Qy 

Db 


617 
686 


SVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHF 

1 : 1 1 1 1 : 1 : 1 1 1 1 1 1 1 1 1 : : 1 1 1 1 1 II 1 1 1 1 1 : : 1 1 : III 1 : : 1 1 : 1 1 
SRSAVKRLQLAIFAPALCTSLEYSLRVYCLEDTPAALKEVLELERTLGGYLVEEPKTLLF 


676 
745 


Qy 

Db 


677 
746 


KDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLA 

1111111111:11:1 : 1 : 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 : 1 : 1 1 II 1 1 1 1 1 1 ::: : 
KDSYHNLRLSLHDIPHAHWRSKLIJ^YQEIPFYHVWNGSQKALHCTFTLERHSLASTEFT 


736 
805 


Qy 


737 


CKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL — VGPSAFKIPFLIRQKI 

1 1 : 1 1 II 1 : 1 1 1 : : : : 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 
CKVCVRQVEGEGQIFQLHTTLA-ETPAGSLDALCSAPGNAATTQLGPYAFKIPLSIRQKI 


794 


Db 


806 


864 


Qy 
Db 


795 
865 


ISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAA 

: 1 1 1 1 II III Mill : 1 : 1 : : 1 1 : 1 III : 1 1 : 1 1 1 1 1 : 1 : 1 ■ : 1 1 : 1 
CNS LDAPNS RGNDWRLLAQKLSMDRYLN YFATKAS PTGVI LDLWEARQQDDGDLNS LASA 


854 
924 


Qy 


855 


VAGLGQ P DAGL FT VS EAEC 87 3 




Db 


925 


LEEMGKSEMLVAMTTDGDC 943 





RESULT 7 

US-09-949-016-10665 

; Sequence 10665, Application US/09949016 
; Patent No. 6812339 
; GENERAL INFORMATION: 

; APPLICANT: VENTER, J. Craig et al . 

; TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

; TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES 

THEREOF 

; FILE REFERENCE: CL001307 

; CURRENT APPLICATION NUMBER: US/09/949,016 

; CURRENT FILING DATE: 2000-04-14 

; PRIOR APPLICATION NUMBER: 60/241,755 

; PRIOR FILING DATE: 2000-10-20 

; PRIOR APPLICATION NUMBER: 60/237,768 

; PRIOR FILING DATE: 2000-10-03 

; PRIOR APPLICATION NUMBER: 60/231,498 

; PRIOR FILING DATE: 2000-09-08 

; NUMBER OF SEQ ID NOS : 207012 

; SOFTWARE: FastSEQ for Windows Version 4.0 



SEQ ID NO 10665 
LENGTH: 769 
TYPE: PRT 
ORGANISM: Human 
US-09-949-016-10665 

Query Match 50.2%; Score 2337.5; DB 4; Length 769; 

Best Local Similarity 56.1%; Pred. No. 1.5e-212; 

Matches 432; Conservative 136; Mismatches 179; Indels 23; Gaps 7; 

Qy 122 LRKNFEQEPLAKEVSLEQGIVTjPCRPPEGIPPAEVTS^ 181 

III I I I I I I I I I I I I I : : I I I I I I I I I I I I I I I : I I I :: I I I I III : I : I 
Db 3 LRKTFEQEPLGKEVSLEQEVLLQCRPPEGIPVAEVEWLKNEDIIDPVEDRNFYITIDHNL 62 

Qy 182 WRQARLADTANYTCVAKNIVARRRSASAAV^ 241 

::: I I I I : I I I I I I I I I I I I I I : I : I : I I I I I I I I I I I I I I I I I I I : : I I I I : I I I : 
Db 63 IIKQARLSDTANYTCVAKNIVAKRKSTTATVIVYVNGGWSTWTEWSVCNSRCGRGYQKRT 122 

Qy 242 RSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPA 301 

I : I I I I I I I I I I I I I I I I : I I I II I I I II I I I : I I I I I I II : I I I I I III: I I 
Db 123 RTCTNPAPLNGGAFCEGQSVQKIACTTLCPVDGRWTPWSICWSTCGTECTHWRRRECTAPA 182 

Qy 302 PRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGL-IAVAVCLVLLLLVLILVYCR 360 

I : I I I : : I I I : : I I I I I : : I : I I I I I I I : I I I I I I : : : I : I I : 

Db 183 PKNGGKDCDGLVl.QSKNCTDGLCMQTAPDSDDVALWGIVIAVIVCLAISVWALFVYRK 242 

Qy 361 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 420 

: I I : I I I I I I I I I : I I : : I I l" : III:: I : I : I I 

Db 243 NHRDFESDI I DSSALNGGFQPVNI KAARQD LLAVP P DLT SAAAMYRGPVYALHD- VS 298 

Qy 421 PKFQLTNGHLLS PLGGGRHTLHHS S PTSEAEEFVSRLS TQNYF R 464 

I : I I : I I I : : : : : I I : I I I : I I I I : : 

Db 299 DKIPMTNSPILDPLPNLKIKWNTSGAVTPQDDLSEFTSKLSPQMTQSLLENEALSLKNQ 358 

Qy 465 SLPRGT — SNMTYGTFNFLGGRLMIPNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLP 522 

II I I I : I : I I III I : : I I : I : I I II I I I I : I : : I I : I : I : I : I : I I 

Db 359 SLARQTDPSCTAFGSFNSLGGHLIVPNSGVSLLIPAGAIPQGRVYEMYVTVHRKETMRPP 418 

Qy 523 LAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLG 582 

: I II I I : I : I I I I I I I I I I I I 1 1 : I I I I ': I : : I : II I : : I MM: : I 
Db 419 MDDSQTLLTPWSCGPPGALLTRPWLTMHHCADPNTEDWKILLKNQAAQGQWEDVVWG 478 

Qy 583 EEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIR 642 

II : I I I : I I I : : I I I : I I I I : : I I I I I I I I : I I : I : I I I I : I I 
Db 479 EENFTTPCYIQLDAEACHILTENLSTYALVGHSTTKAAAKRLKLAIFGPLCCSSLEYSIR 538 

Qy 643 VYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVS 702 

I I I I II Mill:: I I : I : I I I I : : II : MM I II I I I II II : I I I I I I II 
Db 539 VYCLDDTQDALKEILHLERQMGGQLLEEPKALHFKGSTHNLRLSIHDIAHSLWKSKLLAK 598 

Qy 703 YQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTR 762 

I I M I I I I : I : T: I I II I II II I I I : I : I III I I I I I : I I I : I ::: : 
Db 599 YQEI PFYHVWSGSQRNLHCTFTLERFSLNTA/ELVCKLCVRQVEGEGQI FQLNCTVSEEPT 658 

Qy 763 FAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLS 822 

: I I : : : I I I I I II I II I : I I I I I I I I II I I I I : I I : I : 
Db 659 GIDLPLLDPANTITTVTGPSAFSIPLPIRQKLCSSLDAPQTRGHDWRMLAHKLNLDRYLN 718 



Qy 823 F FAS K P S P TAMI LNLWEARH FPN GN L S Q LAAAVAGLGQ P D AGL FT VS EAE 872 

: I I : I I I ! : I I : I I I I : : I I : I I I I I I I : : I : : : : I : 
Db 719 YFATKSSPTGVTLDLWEAQNFPDGNLSMLAAVXEEMGRHETVVSLAAEGQ 768 



RESULT 8 

US-09-969-532-16 

; Sequence 16, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 
Encoding the Same 

; FILE REFERENCE: LEX- 02 4 4 -USA 

; CURRENT APPLICATION NUMBER: US/09/969,532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS : 33 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 16 

LENGTH: 886 
; TYPE: PRT 

; ORGANISM: homo sapiens 
US-09-969-532-16 

Query Match 40.4%; Score 1881; DB 4; Length 886; 

Best Local Similarity 41.9%; Pred. No. 4.3e-169; 

Matches 376; Conservative 155; Mismatches 261; Indels 106; Gaps 12; 

Qy 10 VPGANPDLLPHFLVT£PEDVYIVT<NKPVLLVCKA 69 

: I I I MM: 11:1 I I : I : I : I III II I I I I I I I I I I I I : I I I : I 
Db 46 IPSA-PGTLPHFIEEPDDAYIIKSNPIALRCKARPAMQIFFKCNGEWVHQNEHVSEETLD 104 

Qy 70 GSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQE 129 

MM II I I I : I I I I I I I : I I I M M II I I : I I : I I MM II I I II I : 
Db 105 ESS GLKVREVFINVTRQQVEDFHGPEDYWCQCVAWSHLGT S KS RKASVRI AYLRKNFEQD 164 

Qy 130 PLAKEVSLEQGIVXPCRPPEGIPPAEV^WLRNEDLVI)PSLDPNWITREHSLWRQARLA 189 

I Ml : I I I I II I I I I M II I II I : I I : M I h : I : I :: I I I I I : 
Db 165 PQGREVPIEGMIVLHCRPPEGVPAAEVEWLKNEEPIDSEQDENIDTRADHNLIIRQARLS 224 

Qy 190 DTANYTCVAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAP 249 

I : I I II M I I M M I I I I I M I I : I I I M I I I I I 
Db 225 DSGNYTCMAANIVAKRRSLSATVVVYVDGSWEVWSEWSVCSP 266 

Qy 250 LNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEEC 309 

M I I III: I Mill: I 
Db 267 ECEHLRIRECTAPPPRNGGKFC 288 

Qy 310 QGT DLDT RNCT S DLCVH S AS G P EDVAL YVGL I AVAVCLVLLLLVL I LVYC RKKEGLD S DV 369 

: I : : I I I II: 1 = 111 I I II I : -| I I : II 

Db 289 EGLSQESENCTDGLCILGIENASDIALYSGL-GAAWAVAVLVIGVTLYRRSQSDYGVDV 347 



Qy 



370 ADSSILTSGFQPVSIKPSKADNPHLL— TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTN 427 



Db 



I I I I I I I I : I : I II : I I I I I : I I I : I III I : I 
348 IDSSALTGGFQTFNFKTVRQGNSLLLNSAMQPDL-WSRTYSGPIC-LQD-PLDKELMTE 404 



Qy 428 GHLLSPLG GGRHTLHHSSPTSEAEEFV 454 

I : I I | | : | : : : : 

Db 405 SSLFNPLSDIKVKVQSSFMVSLGVSERAEYHGKNHSRTFPHGNNHSFSTMHPRNKM-PYI 463 

Qy 455 SRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLH 514 

II III I III I I I I I : : I I I I : I I I I I III : | | I : : : : 

Db 464 QNLS SLPTRTELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIYMSIN 517 

Qy 515 KPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGS 574 

: I I I : I I I I I : I I I I : : : I I | : I I : I : I : : I I I : : : I 
Db 518 QGEP-SLQSDGSEVLLSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWNIHLKKRTQQGK 576 

Qy 575 WEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVAC 634 

I I : I : : : I : I Mh I I : I : I : I I I I : : I I : I I : : I : : I 
Db 577 WEEVMSVEDESTS — CYCLLDPFACHVLLDSFGTYALTGEPITDCAVKQLKVAVFGCMSC 634 

Qy 635 TSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSL 694 

I I : I I : I I I I : : I I : I II I : I I I I : : I I : : I I I I : : I : : I : I : I I 
Db 635 NSLDYNLRVYCVDNTPCAFQEWSDERHQGGQLLEEPKLLHFKGNTFSLQISVLDIPPFL 694 

Qy 695 WKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSIN 754 

I : I : I I : I I : I : : I I I I : I I I : I : I : I : I I : : I : : I I : 
Db 695 WRIKPFTACQEVPFSRVWCSNRQPLHCAFSLERYTPTTTQLSCKICIRQLKGHEQILQVQ 754 

Qy 755 FNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQK 814 

: I : I : : : II I "l I I I I I : I I I : I : : I I : I I I : I I I I 

Db 755 TS I LES ERET ITFFAQEDST FPAQTGPKAFKI P YS I RQRI CAT FDT PNAKGKDWQMLAQK 814 

Qy 815 LHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAE 872 

: : : I I : I I : : I I : I : I I I I I I I I I : I : I III: : I : I : I I : : 
Db 815 NSINRNLSYFATQSSPSAVI LNLWEARHQHDGDLDSLACALEEIGRTHTKLSNISESQ 872 



RESULT 9 

US-09-969-532-14 

; Sequence 14, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 
Encoding the Same 

; FILE REFERENCE: LEX-0244-USA 

; CURRENT APPLICATION NUMBER: US/09/969, 532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS : 33 

; SOFTWARE: Fast SEQ for Windows Version 4.0 
; SEQ ID NO 14 

LENGTH: 897 

TYPE: PRT 
; ORGANISM: homo sapiens 
US-09-969-532-14 



Query Match 40.2%; Score 1872.5; DB 4; Length 897; 

Best Local Similarity 41.5%; Pred. No. 2.8e-168; 

Matches 377; Conservative 155; Mismatches 260; Indels 117; Gaps 13; 



Qy 10 VPGANPDLLPHFLVEPEDWIVTCNKPVXLVCKAVPATQIFFKC^ 69 

: I I I MM: I I : I I I : I : I : I III II I I I I I I I I I I I I : I I I : I 
Db 46 IPSA-PGTLPHFIEEPDDAYIIKSNPIALRCKARPAMQIFFKCNGEWVHQNEHVSEETLD 104 

Qy 70 GSSGLPTMEV^I^SRQQV^KVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQE 129 

I I I I II llhlllll I I : I I I I I II I I I I : I I : I I : I I I I I I I I I I : 
Db 105 ESSGLKVl^EVFI^^VTRQQv^DFHGPEDYWCQCVAWSHLGTSKSRKASvl^IAYLRKNFEQD 164 

Qy 130 PIAKEVSLEQGIVTiPCRPPEGIPPAEV^WLRNEDLvTDPSLDPNVTITREHSLVAmQAR^ 189 

I : I I : I I I I I I I I I I : I I I I I I I : I I :: I I I : : I : I : : I I I I I : 
Db 165 PQGREVPIEGMIVLHCRPPEGVPAAEVEWLKNEEPIDSEQDENIDTRADHNLIIRQARLS 224 

Qy 190 DTANYTCVAKNIVARRRSASAAVIVYWGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAP 249 

I : I I I I : I lllhlll II I : I I I : I I I : I I I I I I 
Db 225 DSGNYTCMAANIVAKRRSLSATvVVYVDGSWEVWSEWSVCSP 266 

Qy 250 LNGGAFCEGQ^QKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEEC 309 

: I I I III: I Mill: I 
Db 267 ECEHLRIRECTAPPPRNGGKFC 288 

Qy 310 QGTDLDTRNCTSDLCV H S AS G P E DVAL YVG L I AVAVC LVLLL LVL I LVY 358 

:| :: III II: I Mill II II I :|:: : :| 

Db 289 EGLSQESENCTDGLCILDKKPLHEIKPQSIENASDIALYSGL-GAAWAVAVLVIGVTLY 347 

Qy 359 CRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL — TIQPDLSTTTTTYQGSLCPRQ 416 

I : II II I I I I I I : I : I II : I I I I I : I I I : I I 

Db 348 RRS Q S D YGVDVI D S SALT GG FQT FN F KT VRQ GN S LLLN S AMQ P D L- T VS RTYSGPIC-LQ 405 

Qy 417 DGPSPKFQLTNGHLLSPLG GGRHTLHH 443 

I I I : I I : I I II: 

Db 406 D-PLDKEIJVITESSLFNPLSDIKVWQSSFMVSLGVSERAEYHGKNHSRTFPHGNNHSFST 464 

Qy 444 SSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPR 503 

I : : : : II III I III I I I I M : I I I M I I I I I III 

Db 465 MHPRNKM-PYIQNLS SLPTRTELRTTGVFGHLGGRLVMPNTGVSLLI PHGAI PE 517 

Qy 504 GKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWS 563 

: II I : : : : : I I I : I I I I I : I I I I : : : I I I : I I : I : I : 
Db 518 ENSWEIYMSINQGEP-SLQSDGSEVXLSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWN 576 

Qy 564 LRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYYFTEQLGRFALVGEALSVAAAKR 623 

: II I: : : I I M I: : : M I MM I I : I : I : I I I I : : I I : 
Db 577 IHLKKRTQQGKWEEVMSVEDESTS — CYCLLDPFACHVLLDSFGTYALTGEPITDCAVKQ 634 

Qy 624 LKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNL 683 

II : : I : : I I I : I M II I M : I I : II I M I I I I : : I I : : I I I I : : I 

Db 635 LKVAVFGCMSCNSLDYNLRVYCVDNTPCAFQEWSDERHQGGQLLEEPKLLHFKGNTFSL 694 

Qy 684 RLS IHDVPS SLWKS KLLVS YQEI PFYHIWNGTQRYLHCTFTLERVS PSTSDLACKLWVWQ 743 

: : I: I: I I I : I : I M I I : I : : I I I M I I I : M M M I I : : I 
Db 695 QISVLDIPPFLWRIKPFTACQEVPFSRVWCSNRQPLHCAFSLERYTPTTTQLSCKICIRQ 754 



Qy 744 VEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCR 803 

::| I : :l : I : :: II I I I I I I I : I I I : I :: I I 

Db 755 LKGHEQILQVQTSILESERETITFFAQEDSTFPAQTGPKAFKIPYSIRQRICATFDTPNA 814 

Qy 804 RGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDA 863 

: I II: I I I I : : : I I : I I : : I I : I : I I I I I I I I I : I : I II I : : t : 
Db 815 KGKDWQMLAQKNSINRNLSYFATQSSPSAVI LNLWEARHQHDGDLDSLACALEEIGRTHT 874 

Qy 864 GLFTVSEAE 872 

I : I I :: 
Db 875 KLSNISESQ 883 



RESULT 10 
US-09-969-532-12 

; Sequence 12, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 

Encoding the Same 

; FILE REFERENCE: . LEX- 024 4-USA 

; CURRENT APPLICATION NUMBER: US/ 09/969, 532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS : 33 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 12 

LENGTH: 900 

TYPE: PRT 
; ORGANISM: homo sapiens 
US-09-969-532-12 

Query Match 40.0%; Score 1864; DB 4; Length 900; 

Best Local Similarity 41.2%; Pred. No. 1.8e-167; 

Matches 376; Conservative 155; Mismatches 261; Indels 120; Gaps 13; 

Qy 10 VPGANPDLLPHFLVTSPEDVYIVTCNKPVT^VCK^^ 69 

: I I I MM: | | : | I I : I : | : I I I I II I II I I M I I I I I Ml I : I 
Db 46 I PSA- PGTLPHFI EEPDDAYI I KSNPIALRCKARPAMQI FFKCNGEWVHQNEHVSEETLD 104 

Qy 70 GSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQE 129 

I II I II I I I M M M I I M I I M I I M I I : I I M I : I I I I M I M I : 
Db 105 ESSGLKVT^EVTINVTRQQVEDFHGPEDYWCQCVAWSHLGTSKSRKASVRIAYLRKNFEQD. 164 

Qy 130 PLAKEVSLEQGI VTjPCRPPEGIPPAEVEWLRNEDLVT)PSLD 189 

I Ml M I I I I I I I I I M II I I I I M I : M II: M M :: I II I I : 
Db 165 PQGREVPIEGMIVLHCRPPEGVPAAEVEWLKNEEPIDSEQDENIDTRADHNLIIRQARLS 224 

Qy 190 DTANYTCVAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAP 249 

I : MUM II II M I I II I M II M I I M I I II I 
Db 225 DSGNYTCMAANIVAKRRSLSATVWYVDGSWEVWSEWSVCSP 266 

Qy 250 LNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEEC 309 

: I I I III: I Mill: I 



Db 



267 



ECEHLRIRECTAPPPRNGGKFC 288 



Qy 310 QGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDV 369 

: I :: I I I I I : I : I I I II II I : I :: : : | I : I I 

Db 289 EGLSQESENCTDGLCILGIENASDIALYSGL-GAAWAVAVLVIGVTLYRRSQSDYGVDV 347 

Qy 370 ADSSILTSGFQPVSIKPSKA DNPHLL — TIQPDLSTTTTTYQGSLC 413 

I I I I I I I I : I : I II : I I I I I : I I I : I 

Db 348 IDSSALTGGFQTFNFKTVRQAKNIMELMIQEKSFGNSLLLNSAMQPDL-TVSRTYSGPIC 406 

Qy 414 PRQDGPSPKFQLTNGHLLSPLG GGRHT 440 

I I I I : I I : I I II: 
Db 407 -LQD-PLDKELMTESSLFNPLSDIKVKVQSSFMVSLGVSERAEYHGKNHSRTFPHGNNHS 464 

Qy 441 LHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDA 500 

I :: :: II III I III I I I I I :: I I I I : I I I I I I 

Db 465 FSTMHPRNKM-PYIQNLS S LPTRTELRTTGVFGHLGGRLVMPNTGVS LLI PHGA 517 

Qy 501 IPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPD 560 

II : | | | : : : : : | I I : I I I I I : I I I I : : : I I I : I I : | : 
Db 518 IPEENSWEIYMSINQGEP-SLQSDGSEVLLSPEVTCGPPDMIVTTPFALTIPHCADVSSE 576 

Qy 561 SWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAA 620 

I : : I I I : : : I I I : I : : : I : I III: I I : I : I : I I I I :: I 
Db 577 HWNIHLKKRTQQGKWEEVMSVEDESTS — CYCLLDPFACHVLLDSFGTYALTGEPITDCA 634 

Qy 621 AKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSY 680 

I : I I : : I : : I I I : I I : I I I I : : I I : I I I I : I I I I : : I I : : I I I I : 
Db 635 VKQLKVAVFGCMSCNSLDYNLRVYCVDNTPCAFQEWSDERHQGGQLLEEPKLLHFKGNT 694 

Qy 681 HNLRLS I HDVPS S LWKS KLLVS YQEI P FYHIWNGTQRYLHCT FT LERVS P ST S DLACKLW 740 

: I : : I : I : I I I : I : I I : I I : I : : I I I I : I I I : I : I : I : I I : 
Db 695 FSLQISVLDIPPFLWRIKPFTACQEVPFSRVWCSNRQPLHCAFSLERYTPTTTQLSCKIC 754 

Qy 741 VWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDP 800 

: I : : I I : : I : I : :: II I I I I I I I : I I I : I : : I 

Db 755 IRQLKGHEQILQVQTSILESERETITFFAQEDSTFPAQTGPKAFKIPYSIRQRICATFDT 814 

Qy 801 PCRRGADWRTIJVQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQ 8 60 

I : I i I : I I I I : : : I I : II : : I I : I : II I I I I I I I : I : I I I I : " : I :" 
Db 815 PNAKGKDWQMLAQKNSINRNLSYFATQSSPSAVI LNLWEARHQHDGDLDSLACALEEIGR 874 

Qy 861 PDAGLFTVSEAE 872 

I : I I :: 

Db 875 THTKLSNISESQ 886 

RESULT 11 
US-09-969-532-10 

Sequence 10, Application US/09969532 
Patent No. 6777232 
GENERAL INFORMATION: 
APPLICANT: Walke, D. Wade 
APPLICANT: Scoville, John 

TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 
Encoding the Same 
; FILE REFERENCE: LEX-0244-USA 



CURRENT APPLICATION NUMBER: US/09/969, 532 
CURRENT FILING DATE: 2001-10-02 
PRIOR APPLICATION NUMBER: US 60/237,280 
PRIOR FILING DATE: 2000-10-02 
NUMBER OF SEQ ID NOS : 33 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 10 
LENGTH: 911 
TYPE: PRT 

ORGANI SM : homo sapiens 
US-09-969-532-10 

Query Match 39.8%; Score 1855.5; DB 4; Length 911; 

Best Local Similarity 40.8%; Pred. No. 1.2e-166; 

Matches 377; Conservative 155; Mismatches 260; Indels 131; Gaps 14; 

Qy 10 VPGANPDLLPHFLVTlPEDvYIvT^KPVXLVCKAVPATQIFFKCNGEWVl^QVDHVIERSTD 69 

: I I I MM: I I : I I I : I : I : I III II I I I I I I I I I I I I : I I I : I 
Db 46 I PSA- PGTLPHFIEEPDDAYI I KSNPIALRCKARPAMQI FFKCNGEWVHQNEHVSEETLD 104 

Qy 70 GSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQE 129 

I I II 11111:11111 11:111111111 I I : I I : I I : I I I M I I I I I : 
Db 105 ESSGLKVREVFINVTRQQVEDFHGPEDYWCQCVAWSHLGTSKSRKASVRIAYLRKNFEQD 164 

Qy 130 PLAKEVSLEQGIVXPCRPPEGIPPAEV^WLRNEDLVTlPSLDPNVYITREHSLvAmQARlA 189 

I : I I : I I I I I II I I I : I I I I I I I : I I :: I I I : : | : I : : I I I I I : 
Db 165 PQGREVP I EGMI VLHCRP P EGVPAAEVEWLKNEE P I D S EQDEN I DT RADHN L 1 1 RQARLS 224 

Qy 190 DTANYTCVAKNIVARRRSASAAVI VYWGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAP 249 

I : I I I I : I I I I I : I I I I I I : I I I : I I I : I I I I I I 
Db 225 DSGNYTCMAANIVAKRRSLSATWVYVDGSWEVWSEWSVCSP 266 

Qy 250 LNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEEC 309 

: I I I III: I Mill: I 
Db 267 ECEHLRI RECTAP P P RNGGKFC 288 

Qy 310 QGTDLDTRNCTSDLCV H S AS G P EDVAL YVGL I AVAVC LVL LLLVL I LVY 358 

: I : : I II I I : I I : M I I I I I I : I : : : : I 

Db 289 EGLSQESENCTDGLCILDKKPLHEIKPQSIENASDIALYSGL-GAAWAVAVLVIGVTLY 347 

Qy 359 CRKKEGLDSDVADSSILTSGFQPVSIKPSKA DNPHLL — TIQPDLS 402 

I : I I M I I I I I I : I : III Mill 

Db 348 RRSQSDYGVDVIDSSALTGGFQTFNFKTVRQAKNIMELMIQEKSFGNSLLLNSAMQPDL- 406 

Qy 403 TTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLG 435 

I : I I I : I I I I I : I I Ml 
Db 407 TVSRTYSGPIC-LQD-PLDKELMTESSLFNPLSDIKVKVQSSFMVSLGVSERAEYHGKNH 464 

Qy 436 GGRHTLHHS S PTS EAEEFVS RLSTQN YFRS LPRGTSNMT YGT FN FLGGRLMI PN 489 

II: I : : : : I I I I I I I I I I II M : M I 

Db 465 SRTFPHGNNHSFSTMHPRNKM-PYIQNLS SLPTRTELRTTGVFGHLGGRLVMPN 517 

Qy 490 TGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVIL 549 

II: III II III :|||::::: | | I : Mil I : I I I I : : : I I I 
Db 518 TGVSLLIPHGAIPEENSWEIYMSINQGEP-SLQSDGSEVLLSPEVTCGPPDMIVTTPFAL 576 



Qy 



550 AMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRF 609 



: I I : I : I : : I I I : : : I I I : I : : : I : I III: I I : I : I : 
Db 577 TIPHCADVSSEHWNIHLKKRTQQGKWEEVMSVEDESTS — CYCLLDPFACHVLLDSFGTY 634 

Qy 610 ALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQ 669 

MM:: I I : I I : : I : : I I I : I I : I I I I : : I I : I II I : MM:: 
Db 635 ALTGEP I TDCAVKQLKVAVFGCMS CNSLD YNLRVYCVDNT PCAFQEWS DERHQGGQLLE 694 

Qy 670 EPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVS 729 

II : : I I I I : : I : : I : I : I I I : I : I I : II : I : : I I I I : I I I : 
Db 695 EPKLLHFKGNTFSLQISVLDIPPFLWRIKPFTACQEVPFSRVWCSNRQPLHCAFSLERYT 754 

Qy 730 PST S DLACKLWVWQVEGDGQS FS INFN I TKDTRFAELLALES EAGVPALVGPSAFKI PFL 789 

I : I : I : M : : I : : I I : : I : I : : : II I I I I I I I : 

Db 755 PTTTQLSCKICIRQLKGHEQILQVQTSILESERETITFFAQEDSTFPAQTGPKAFKIPYS 814 

Qy 790 IRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLS 849 

II I : I : : I I : I I I : I I I I : : : I I : I I : : I I : I : I I I I I I I I I MM 

Db 815 IRQRICATFDTPNAKGKDWQMLAQKNSINRNLSYFATQSSPSAVI LNLWEARHQHDGDLD 874 

Qy 850 Q LAAAVAGLGQ P DAGL FT VS EAE 872 

I I I : : I : I : I I : : 
Db 875 SLACALEEIGRTHTKLSNISESQ 897 



RESULT 12 
US-09-969-532-32 

; Sequence 32, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 
Encoding the Same 

; FILE REFERENCE: LEX-0244-USA 

; CURRENT APPLICATION NUMBER: US/09/969,532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS: 33 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 32 

LENGTH: 655 

TYPE: PRT 
; ORGANISM: homo sapiens 
US-09-969-532-32 

Query Match 27.0%; Score 1259.5; DB 4; Length 655; 

Best Local Similarity 36.8%; Pred. No. 2.7e-110; 

Matches 262; Conservative 128; Mismatches 216; Indels 105; Gaps 11; 

Qy 197 VAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFC 256 

M II I I M II II I : II I : I I I M I II I I 
Db 1 MAANIVAKRRSLSATWVYVDGSWEVWSEWSVCSP 35 



QY 
Db 



257 EGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDT 316 

: I I I II I : I II I II : I : I : : 
36 ECEHLRIRECTAPPPRNGGKFCEGLSQES 64 



Qy 317 RNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDYADSSILT 376 

III II: I : I I I II II | : | : : : : I I : I I II I I I 

Db 65 ENCTDGLC I LGI ENAS DI AL YS GL- GAAWAVAVLVI GVTL YRRSQ S DYGVDVI D S SALT 123 

Qy 377 SGFQPVSIKPSKADNPHLL— TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPL 434 

I I I : I : I I I : I I I I I : I I I : I I I I I : I I : I I 
Db 124 GGFQTFNFKTVRQGNSLLLNSAMQPDL-TVSRTYSGPIC-LQD-PLDKELMTESSLFNPL 180 

Qy 435 G GGRHTLHHSSPTSEAEEFVSRLSTQN 461 

II: I :: :: II 
Db 181 SDIKVKVQSSFMVSLGVSERAEYHGKNHSRTFPHGNNHSFSTIVIHPRNKM-PYIQNLS 236 

Qy 462 YFRSLPRGTSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRL 521 

III I III I I I I I :: I I I I : I I I I I III : | | | : : : : : | | 
Db 237 S LPTRTELRTTGVFGHLGGRLVMPNTGVS LLI PHGAI PEENSWEI YMS INQGEP- SL 292 

Qy 522 PLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHL 581 

I : I I I I I : I I I I : : : I I I : I I : I : I : : I I I : : : I I I : I : : 
Db 293 QSDGSEVLLS PEVTCGPPDMI VTTPFALTI PHCADVS SEHWNIHLKKRTQQGKWEEVMSV 352 

Qy 582 GEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNI 641 

: I : I III: I I : I : I : I I I I : : I I : I I : : I : : I I I : I I : 
Db 353 EDESTS — CYCLLDPFACHVLLDSFGTYALTGEPITDCAVKQLKVAVFGCMSCNSLDYNL 410 

Qy 642 RVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLV 701 

I I I I : : I I : I I I I : I I I I : : I I : : I I I I : : I : : I : I : I I I : I 
Db 411 RVYCVDNTPCAFQEWSDERHQGGQLLEEPKLLHFKGNTFSLQISVLDIPPFLWRIKPFT 470 

Qy 702 SYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDT 761 

: I I : I I : I : : I I I I : I I I : I : I : I : I I : : I : : I I : : I : 
Db 471 ACQEVPFSRVWCSNRQPLHCAFSLERYTPTTTQLSCKICIRQLKGHEQILQVQTSILESE 530 

Qy 762 RFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHL 821 

I : : : II I I I I I I I : I I I : I : : I I : I I I : I I I I : : : I 

Db 531 RET I T FFAQEDST FPAQTGPKAFKI P YS I RQRI CAT FDT PNAKGKDWQMLAQKNS INRNL 590 

Qy 822 SFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAE 872 

I : I I : : I I : I : I I I I I I I I I : I : I III: : I : I : I I : : 
Db 591 SYFATQSSPSAVI LNLWEARHQHDGDLDSLACALEEIGRTHTKLSNISESQ 641 



RESULT 13 
US-09-969-532-30 

; Sequence 30, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 

Encoding the Same 

; FILE REFERENCE: LEX-0244-USA 

; CURRENT APPLICATION NUMBER: US/09/969,532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS : 33 



SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 30 
LENGTH: 666 
TYPE: PRT 

ORGANISM: homo sapiens 
US-09-969-532-30 

Query Match 26.8%; Score 1251; DB 4; Length 666; 

Best Local Similarity 36.4%; Pred. No. 1.8e-109; 

Matches 263; Conservative 128; Mismatches 215; Indels 116; Gaps 12; 

Qy 197 VAKNIVARRRSASAAVIVYWGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFC 256 

: I II I I : I II I I I : I II : I I I : I I I I I I 
Db 1 MAANIVAKRRSLSATVWYVDGSWEVWSEWSVCSP 35 

Qy 257 EGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDT 316 

: I I I I I I : I I I I I I : I : I : : 
Db 36 ECEHLRI RECTAPPPRNGGKFCEGLSQES 64 

Qy 317 RNCTSDLCV HSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGL 365 

Mill: I I : I I I I I | | | : | : : : : | I : 

Db 65 ENCTDGLCILDKKPLHEIKPQSIENASDIALYSGL-GAAWAVAVLVIGVTLYRRSQSDY 123 

Qy 366 DSDVADSSILTSGFQPVSIKPSKADNPHLL— TIQPDLSTTTTTYQGSLCPRQDGPSPKF 423 

I I I I I I I I I I : I : I II : I II I I : I I I : I III I 
Db 124 GVDVIDSSALTGGFQTFNFKTVRQGNSLLLNSAMQPDL-TVSRTYSGPIC-LQD-PLDKE 180 

Qy 424 QLTNGHLLS PLG GGRHTLHHSSPTSEA 450 

: I I : I I | | : | : : 

Db 181 IJyiTESSLFNPLSDIKVWQSSFMVSLGVSERAEYHGKNHSRTFPHGNNHSFSTMHPRNm 240 

Qy 451 EEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI Y 510 

: : II II I I III I I I I I :: I I I I : I I I I I III : I I I 

Db 241 -PYIQNLS SLPTRTELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIY 293 

Qy 511 LTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQS 570 

:::::! I I : I I I I I : I I I I : : : I I I : I I : I : I : : I I I : : 
Db 294 MSINQGEP-SLQSDGSEVLLSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWNIHLKKRT 352 

Qy 571 CEGSWEDVXHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFA 630 

: I I I : I : : : I : I III: I I : I : I : I I I I : : I I : I I : : I 
Db 353 QQGKWEEVMSVEDESTS — CYCLLDPFACHVXLDSFGTYALTGEPITDCAVKQLKVAVFG 410 

Qy 631 PVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDV 690 

: : I I I : I I : I I I I : : I I : I I I I : I I I I : : I I : : I I I I : : I : : I : I : 
Db 411 CMSCNSLDYNLRVYCVDNTPCAFQEWSDERHQGGQLLEEPKLLHFKGNTFSLQISVLDI 470 

Qy 691 PSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQS 750 

I 11:1 : I I : I I : I : : I I I I : I I I : I : I : I : I I : : I : : I I 
Db 471 PPFLWRIKPFTACQEVPFSRVWCSNRQPLHCAFSLERYTPTTTQLSCKICIRQLKGHEQI 530 

Qy 751 FSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRT 810 

: : I : I : : : II I I I I I I I : I I I : I :: I I : I I I : 

Db 531 LQVQTSILESERETITFFAQEDSTFPAQTGPKAFKIPYSIRQRICATFDTPNAKGKDWQM 590 

Qy 811 LAQ KLH LD S H L S FFAS K P S P TAMI LN LWEARH F PN GN L S QLAAAVAGL GQ P DAGL FT VS E 870 

I I I I : : : | | : | | : : I I : I : I I I I I I I I I : I : I III: : I : I : I I 



Db 

Qy 

Db 



591 LAQKNSINRNLSYFATQSSPSAVILNLWEARHQHDGDLDSLACALEEIGRTHTKLSNISE 650 
871 AE 872 
651 SQ 652 



RESULT 14 
US-09-969-532-28 

Sequence 28, Application US/09969532 
Patent No. 6777232 
GENERAL INFORMATION : 
APPLICANT: Walke, D. Wade 
APPLICANT: Scoville, John 

TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 
Encoding the Same 

FILE REFERENCE: LEX-0244-USA 
CURRENT APPLICATION NUMBER: US/09/969,532 
CURRENT FILING DATE: 2001-10-02 
PRIOR APPLICATION NUMBER: US 60/237,280 
PRIOR FILING DATE: 2000-10-02 
NUMBER OF SEQ ID NOS : 33 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 28 
LENGTH: 669 
TYPE: PRT 

ORGANISM: homo sapiens 
US-09-969-532-28 

Query Match 26.7%; Score 1242.5; DB 4; Length 669; 

Best Local Similarity 36.1%; Pred. No. 1.2e-108; 

Matches 262; Conservative 128; Mismatches 216; Indels 119; Gaps 12; 

Qy 197 VAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFC 256 

: I lllhlll II I : I I I : I I I : I I I I I I 
Db 1 MAANIVAKRRSLSATVWYVDGSWEVWSEWSVCSP 35 

Qy 257 EGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDT 316 

: I I I I I I : I I I I I I : I : I :: 
Db 36 ECEHLRI RECTAPPPRNGGKFCEGLSQES 64 

Qy 317 RNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILT 376 

III II: hlllll II I : I : : : : I I : I I I I I I I 

Db 65 ENCTDGLCILGIENASDIALYSGL-GAAWAVAVLVIGVTLYRRSQSDYGVDVIDSSALT 123 

Qy 377 SGFQPVSIKPSKA DNPHLL — TIQPDLSTTTTTYQGSLCPRQDGPS 420 

I I I : I : III : I I I I I : I I I : I I I I 

Db 124 GGFQTFNFKTVRQAKNIMELMIQEKSFGNSLLLNSAMQPDL-TVSRTYSGPIC-LQD-PL 180 

Qy 421 PKFQLTNGHLLSPLG GGRHTLHHSSPT 447 

I : I I : I I I I : I 

Db 181 DKELMTESSLFNPLSDIKVKVQSSFMVSLGVSERAEYHGKNHSRTFPHGNNHSFSTMHPR 240 

Qy 44 8 SEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIY 507 

: : : : II III I III I I I I I : : I I I I : I I I I I III : 

Db 241 NKM-PYIQNLS SLPTRTELRTTGVFGHLGGRLVMPNTGVSLLI PHGAI PEENSW 293 



Qy 508 EIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLK 567 

III::::: I I I : I I I I I : I I I I : : : I I I : I I : I : I :: I I 
Db 294 EIYMSINQGEP-SLQSDGSEVLLSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWNIHLK 352 



Qy 568 KQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLL 627 

I : : : I I I : I : : : I : I III: I I : I : I : I I I I :: I I : I I : 
Db 353 KRTQQGKWEEVMSVEDESTS — CYCLLDPFACHVLLDSFGTYALTGEPITDCAVKQLKVA 410 

Qy 628 LFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSI 687 

: I : : I I I : I I : I I I I : : I I : I I I I : I I I I : : I I : : I I I I : : I : : I : 
Db 411 VFGCMSCNSLDYNLRVYCVDNTPCAFQEWSDERHQGGQLLEEPKLLHFKGNTFSLQISV 470 

Qy 688 HDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGD 747 

I : I I I : I : I I : I I : I : : I I I I : I I I : I : I : I : I I : : I : : I 
Db 471 LDIPPFLWRIKPFTACQEVPFSRVWCSNRQPLHCAFSLERYTPTTTQLSCKICIRQLKGH 530 

Qy 748 GQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGAD 807 

I : : I : I : :: II I I I I I I I : I I I : I : : I I : I I 

Db 531 EQILQVQTSILESERETITFFAQEDSTFPAQTGPKAFKIPYSIRQRICATFDTPNAKGKD 590 

Qy 808 WRTLAQ KLH LD S HL S FFAS K P S PTAMI LN LWEARH F PN GNLS Q LAAAVAGL GQ P DAGL FT 867 

I : I I II : : : I I : I I :: I I : I : I I I I I I I I I : I : I III: : I : I 
Db 591 WQMLAQKNSINRNLSYFATQSSPSAVI LNLWEARHQHDGDLDSLACALEEIGRTHTKLSN 650 

Qy 868 VSEAE 872 

: I I : : 

Db 651 ISESQ 655 



RESULT 15 
US-09-969-532-26 

; Sequence 26, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 

Encoding the Same 

; FILE REFERENCE: LEX-0244-USA 

; CURRENT APPLICATION NUMBER: US/09/969,532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS : 33 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 26 

LENGTH: 680 

TYPE: PRT - 
; ORGANISM: homo sapiens 
US-09-969-532-26 

Query Match 26.5%; Score 1234; DB 4; Length 680; 

Best Local Similarity 35.7%; Pred. No. 7.7e-108; 

Matches 263; Conservative 128; Mismatches 215; Indels 130; Gaps 13; 



Qy 



197 VAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFC 256 
: I I I I I : I I I II I : I I I : I I I : I I I I I I 



Db 



1 MAANIVAKRRSLSATWVYVDGSWEWSEWSVCSP 



35 



Qy 257 EGQNVQKTACATLCPVDGSWS PWS KWSACGLDCTHWRS RECS DPAPRNGGEECQGTDLDT 316 

: I I I I I I : I I I I I I : I : I : : 
Db 36 ECEHLRI RECTAPPPRNGGKFCEGLSQES 64 

Qy 317 RNCTSDLCV HSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGL 365 

III II: I II II I :|:: : :| I : 

Db 65 ENCTDGLCILDKKPLHEIKPQSIENASDIALYSGL-GAAVVAVAVLVTGVTLYRRSQSDY 123 

Qy 366 DSDVADSSILTSGFQPVSIKPSKA DNPHLL — TIQPDLSTTTTTYQ 409 

I I I I I I I I I I : I : III : I I I I I : I I 

Db 124 GVDVIDSSALTGGFQTFNFKTVRQAKNIMELMIQEKSFGNSLLLNSAMQPDL-TVSRTYS 182 

Qy 410 GSLCPRQDGPSPKFQLTNGHLLSPLG G 436 

I : I I I I I : I I : I I I 
Db 183 GPIC-LQD-PLDKELMTESSLFNPLSDIKVKVQSSFMVSLGVSERAEYHGKNHSRTFPHG 240 

Qy 437 GRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLI 496 

I : I : : : : II III I III I I I I I : : I I I I : I I I I 

Db 241 NNHSFSTMHPRNKM-PYIQNLS SLPTRTELRTTGVFGHLGGRLVMPNTGVSLLI 293 

Qy 497 PPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGE 556 

I III : I I I : : : : : I I I : I I I I I : I I I I : : : I I I : I I : 
Db 294 PHGAIPEENSWEIYMSINQGEP-SLQSDGSEVLLSPEVTCGPPDMIVTTPFALTIPHCAD 352 

Qy 557 PSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEAL 616 

I : I : : I I I : : : I I I : I : : : I : I III: I I : I : I : I I II : 
Db 353 VSSEHWNIHLKKRTQQGKWEEVMSVEDESTS— CYCLLDPFACHVLLDSFGTYALTGEPI 410 

Qy 617 SVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHF 676 

: I I : I I : : I : : I I I : I I : I I I I : : I I : I I I I : I I I I : : I I : : I I I 
Db 411 TDCAVKQLKVAVFGCMSCNSLDYNLRVYCVDNTPCAFQEWSDERHQGGQLLEEPKLLHF 470 

Qy 677 KDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLA 736 

I : : I : : I : I : I I I : I : I I : I I : I : : I I I I : I I I : I : I : I : 
Db 471 KGNTFSLQISVXDIPPFLWRIKPFTACQEVPFSRVWCSNRQPLHCAFSLERYTPTTTQLS 530 

Qy 737 CKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIIS 796 

I I : : I : : I I : : I : I : : : II I I I I I I I : I I I : I : 

Db 531 CKICIRQLKGHEQILQVQTSILESERETITFFAQEDSTFPAQTGPKAFKIPYSIRQRICA 590 

Qy 797 S LDP PCRRGADWRTLAQKLHLDSHLS FFAS KPS PTAMI LNLWEARHFPNGNLSQLAAAVA 856 

: I I : I II : I I I I : : : I I : I I : : I I : I : I I I I I I I I I : I : I III: 
Db 591 TFDTPNAKGKDWQMLAQKNSINRNLSYFATQSSPSAVI LNLWEARHQHDGDLDSLACALE 650 

Qy 857 GLGQPDAGLFTVSEAE 872 

: I : I : I I : : 
Db 651 EIGRTHTKLSNISESQ 666 



Search completed: March 1, 2005, 09:05:50 
Job time : 56.3087 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



March 1, 2005, 08:46:18 ; Search time 34.0799 Seconds 

(without alignments) 
2464.715 Million cell updates/sec 

US-10-624-932-2_COPY_26_898 
4660 

1 QQSATVAN2VPGANPDLLPH AVAGLGQP DAGL FTVS EAEC 873 

BLOSUM62 

Gapop 10.0 , Gapext 0.5 
283416 seqs, 96216763 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283416 



Database 



PIR_79:* 
pirl : * 
pir2 : * 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
T32541 

unc-5 protein - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 29-Oct-1999 #sequence_revision 29-Oct-1999 #text_change 09-Jul-2004 
C; Accession: T32541 
R;Latreille, P. 

submitted to the EMBL Data Library, December 1997 

A; Description: The sequence of C. elegans cosmid B0273. 

A;Reference number: Z21187 

A;Accession: T32541 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type:- DNA 
A; Residues: 1-919 <LAT> 

A;Cross-references: UNIPROT : 044171 ; EMBL: AF036698 ; PIDN : AAB88355 . 1 ; 
GSPDB:GN00022; CESP : B0273 . 4a 

A; Experimental source : strain. Bristol N2; clone B0273 
C; Genetics : 

A; Gene: unc-5; CESP : B0273 . 4a 
A; Map position: 4 



A;Introns: 41/3; 108/1; 142/3; 201/1; 323/2; 553/1; 858/3 

C; Superf amily : unc-5 protein; immunoglobulin homology; SH3 homology; 

thrombospondin type 1 repeat homology 

Query Match 21.0%; Score 977;- DB 2; Length 919; 

Best Local Similarity. 28.7%; Pred. No. l.le-62; 

Matches 265; Conservative 168; Mismatches 379; Indels 110; Gaps 31; 

Qy 24 EPEDWIVKNKPVXLVCKAVPATQIFFKCNGEWVRQVDHVIER — STDGSSGLPTMEVRI 81 

: I : I : : : I I I : I I : I I I : I : I I : : I : I I I : I I : I I : : : : 
Db 9 QPKSGYVIRNKPLRLQCRANHATKIRYKCSSKWID — DSRIEKLIGTDSTSGVGYIDASV 66 

Qy 82 NVS RQQVEKVFGLEEYWCQCVAWS S SG TTKSQKAYIRIARLRKNFEQEPLAKEVS 136 

:: I I I : :: : I I I | | | | | : : | : | | : | : | : | : | 

Db 67 DISRIDVDTSGHVDAFQCQCYA SGDDDQDWAS DVATVHLAYMRKH FLKS PVAQRVQ 123 

Qy 137 LEQGIVXPCRPPEGIPPAEVEWLRNEDLVT)PSLDP>r\rfITREH^ 196 

: I I I : I I I I I : I : : : I I III ' II— I I I : I : I I I I 
Db 124 EGTTLQLPCQAPESDPKAELTWYKDGVWQP— DANVIRASDGSLIMSAARLSDSGNYTC 181 

Qy 197 VAKNIVARRRSASAAVIVYVNGGWSTWTEW-SVCSASCG RGWQKR 240 

II: I : : I : I I : I I I I I : I I I I : I 

Db 182 EATWANSRKTDPVEVQIYVDGGWSEWSPWIGTCHVDCPLLRQHAHRIRDPHDVLPHQRR 241 

Qy 241 SRSCTNPAPLNGGAFCEGQWQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 300 

: I : I I I I I I I I : I : I : : I I : I I I I I I I I I I I : I : I I : I 

Db 242 TRTCNNPAPLNDGEYCKGEEEMTRSCKVPCKLDGGWSSWSDWSACSSSCHRYRTRACTVP 301 

Qy 301 APRNGGEECQGTDLDTRNCTSDLCVHSASG — PEDVALYVGLIAVAVCLVLLLLVLILVY 358 

I I I I : I I I I I : I : I I : I I hi : : : : : I : I : 

Db 302 PPMNGGQPCFGDDLMTQECPAQLCTADS SRI VI SDTAVYGSVAS I FI VAS FILAI LAMFC 361 

Qy 359 CR KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTI 397 

I • I " • • I * I • • I • • • * I • 

Db 362 CKRGNSKKSKPLKPQKMNSEKAGGIYYS EPPGVRRLLLEHQHGTLLGEKISSCSQYF 418 

Qy 398 -QPDLSTTTT TYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPT-SE 449 

I I : I I : I II ^ I I I I I I I I : 

Db 419 EPPPLPHSTTLRSGKSAFSGYSSTRNAGSRAALIQECSSSSSGSGGKRTMLRTSSSNCSD 478 

Qy 450 AEEFVSRLSTQNYFRSLPRGTS-NMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YE 508 

: : : I III: : I I I : : I I : : I I : I : 

Db 479 DDNYATLYDYMEDKSVLGLDTSQNIVAAQIDSNGARLSLSKSGARLIVPELAVEGEKM — 536 

Qy 509 IYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGV LLTRPVI LAMDHCGEPS P-D 560 

: I I : : I : : I I I : : I I Mill::: II | | 

Db 537 LYLAVSDTLTDQPHLKPIESALSPVIVIGQCDVSMSAHDNILRRPVWSFRHCASTFPRD 596 

Qy 561 SWSLRLKKQSCEGS-WEDVLHLGEEAPSHLYYCQLEASA CYVFTEQLGRFAL 611 

: I I : I I I I : : M I I : : I I I M I I I I 

Db 597 NWQFTL — YADEGSGWQKAWIGEENLNT^FVQFEQPGKKNDGFGWCHVMTYSLARLML 654 

Qy 612 VGEAL--SVAAAKRLKLLLFAPVACTSLE--YNIRVTCLHDTHDALKEWQLEKQLGGQL 667 

I I: Mil I: I M I :: : M III: M |:: I : I I M 

Db 655 AGHPRRNSLSAAKRVHLAVFGPTEMSAYRRPFELRVYCVPETGAAMESVWKQED— GSRL 712 



Qy 



668 IQEPR — VLHFKDSYHNLRLSIHDV-PSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFT 724 



Db 713 LCESNDFILNEKG NLCICIEDVIPGFSCDGPEWEISETQHRFV AQNGLHCSLK 766 

Qy 725 LERVSPSTSDIACKLWWQVEGDGQSESINFNITKDTRFAELLALESEAGVPALVGPSAF 784 

. I ... I . I . . . . . ••Ill • I 

• i * * • i * I ■ • ••• • • | | | • I 

Db 767 FRPKEINGSQFSTRVIVYQKASSTEPMVM — EVSNEPELYDATSEEREKGSVCV EF 820 

Qy 785 KIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKP — SPTAMILNLWEARH 842 

::|| :: :: III : I I I I I : I I I I : I I I I I I lll:::|:|||| 
Db 821 RLPFGVKDELARLLDMPNESHSDWRGLAKKLHYDRYLQFFASFPDCSPTSLLLDLWEASS 880 

Qy 843 FPNGN-LSQLAAAVAGLGQPDA 863 

: : I : : I : I I I 
Db 881 SGSARAVPDLLQTLRVMGRPDA 902 



RESULT 2 
B44294 

unc-5 protein, long form - Caenorhabditis elegans 
N; Contains: unc-5 protein, short form 
C; Species: Caenorhabditis elegans 

C;Date: 30-Apr-1993 #sequence_revision 28-Jul-1995 #text_change 09-Jul-2004 
C;Accession: B44294; T32540; A44294 

R;Leung-Hagesteijn, C; Spence, A.M.; Stern, B.D.; Zhou, Y. ; Su, M.W.; 
Hedgecock, E.M.; Culotti, J.G. 
Cell 71, 289-299, 1992 

A; Title: UNC-5, a transmembrane protein with immunoglobulin and thrombospondin 

type 1 domains, guides cell and pioneer axon migrations in C. elegans. 

A; Reference number: A44294; MUID : 93046629; PMID: 1384987 

A; Contents: variety Bergerac 

A; Accession: B44294 

A; Molecule type: DNA 

A; Residues: 1-947 <LEU> 

A;Cross-references: UNI PROT : 044171 ; GB:S47168; NID:g258527; PIDN : AAB23867 . 1 ; 
PID:g258529 

A; Note: sequence extracted from NCBI backbone (NCBIN: 116668, NCBIN : 116670, 
NCBIN: 116672, NCBIN : 116674 , NCBIN: 116676, NCBIN: 116678, NCBIN: 116680, 
NCBIN: 116682, NCBIN : 116685, NCBIP : 118648 ) 

A; Note: authors translated the codon CTA for residue 642 as Val; sequence shown 
follows the authors ' translation 

A; Note: mRNA lacking the first exon is equally prevalent 
R;Latreille, P. 

submitted to the EMBL Data Library, December 1997 

A; Description: The sequence of C. elegans cosmid B0273. 

A;Reference number: Z21187 

A;Accession: T32540 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-947 <LAT> 

A;Cross-references: EMBL : AF036698 ; PIDN :AAB88356. 1; GSPDB : GN00022 ; CESP : B0273 . 4b 

A; Experimental source: strain Bristol N2; clone B0273 

C; Genetics : 

A; Gene: unc-5 

A; Map position: 4 

A;Introns: 28/1; 69/3; 136/1; 170/3; 229/1; 351/2; 581/1; 886/3 
C; Function: 



A; Description: required for guidance of pioneering axons and cells migrating 
dorsally along the body wall; proposed to be a receptor on the surface of the 
motile cells 

C;Superfamily: unc-5 protein; immunoglobulin homology; SH3 homology; 
thrombospondin type 1 repeat homology 

C; Keywords: alternative splicing; duplication; glycoprotein; receptor; 
transmembrane protein 

F; 30-947/Product : unc-5 protein, short form #status predicted <ALT> 
F; 46-116/Domain : immunoglobulin homology <IM1> 
F; 153-211/Domain: immunoglobulin homology <IM2> 

F;229-300/Domain: thrombospondin type 1 repeat homology #status atypical <THR1> 
F;301-354/Domain: thrombospondin type 1 repeat homology <THR2> 
F;365-390/Domain: transmembrane #status predicted <TMM> 
F;512-559/Domain: SH3 homology <SH3> 

F;53-114, 65-112, 160-209/Disulf ide bonds: #status predicted 
F;206/Binding, site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 21.0%; Score 977; DB 1; Length 947; 

Best Local Similarity 28.7%; Pred. No. l.le-62; 

Matches 265; Conservative 168; Mismatches 379; Indels 110; Gaps 31; 

Qy 24 EPEDWIVTWKPVXLVCKAVPATQI^^ 81 

: I : I : : : I I I : I I : I I | : | : I | : : | : | I I : I I : I I : : : : ■ 
Db 37 QPKSGYVIRNKPLRLQCRANHATKIRYKCSSKWID — DSRIEKLIGTDSTSGVGYIDASV 94 

Qy 82 NVS RQQVEKVFGLEEYWCQCVAWS S SG TTKSQKAYIRIARLRKNFEQEPLAKEVS 136 

: : I I I : : : : I I I I II I I : : I : I I : I : I : I : I 

Db 95 DI S RI DVDT S GHVDAFQCQC YA S GD D DQ DWAS DVATVH LAYMRKH FLK S P VAQRVQ 151 

Qy 137 LEQGIVXPCRPPEGIPPAEV^WLRNEDLVTDPSLDPNWITREHSLWRQARI^TANYTC 196 

: I I I : I I I I I : I : : : I I III : I I :: I I I : I : I I I I 
Db 152 EGTTLQLPCQAPESDPKAELTWYKDGVWQP — DANVIRASDGSLIMSAARLSDSGNYTC 209 

Qy 197 VAKNIVARRRSASAAVIVYVNGGWSTWTEW-SVCSASCG RGWQKR 240 

II: I : : I : I I : I I I I I : I I I I : I 

Db 210 EATWANSRKTDPV^VQIYVT)GGWSEWSPWIGTCHVDCPLLRQHAHRIRDPHDVLPHQRR 269 

Qy 241 SRSCTNPAPLNGGAFCEGQ^QKTACATLCPVTlGSWSPWSKWSACGLDCTHWRSRECSDP 300 

: I : I I I I I I I I : I : I : : I I : I I I I II I I I I I : I : I I : I 

Db 270 TRTCNNPAPLNDGEYCKGEEEMTRSCKVPCKLDGGWSSWSDWSACSSSCHRYRTRACTVP 329 

Qy 301 APRNGGEECQGTDLDTRNCTSDLCVHSASG— PEDVALYVGLIAVAVCLVLLLLVLILVY 358 

I I I I : I I I I I : I : I I :| I I : I : :: : :| :| : 

Db 330 P PMNGGQPCFGDDLMTQECPAQLCTADS S RI VI S DTAVYGSVAS I FI VAS FI LAI LAMFC 389 

Qy 359 CR KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTI 397 

I : | : : : | : | : : | i : ::| : 

Db 390 CKRGNSKKSKPLKPQKMNSEKAGGIYYS EPPGVRRLLLEHQHGTLLGEKISSCSQYF 446 

Qy 398 -QPDLSTTTT T YQGSLCPRQDGPS PKFQLTNGHLLS PLGGGRHTLHHS S PT— SE 449 

I I : I I : I II I I I I I I I I : 

Db 447 EPPPLPHSTTLRSGKSAFSGYSSTRNAGSRAALIQECSSSSSGSGGKRTMLRTSSSNCSD 506 

Qy 450 AEEFVSRLSTQNYFRSLPRGTS-NMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYE 508 

: : : I III: : I I I : : I I : : I I : I : 

Db 507 DDNYATLYDYMEDKSVLGLDTSQNIVAAQIDSNGARLSLSKSGARLIVPELAVEGEKM-- 564 



Qy 509 IYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGV LLTRPVILAMDHCGEPSP-D 560 

:II : : I :: III:: I I :| III::: M II 

Db 565 LYLAVSDTLTDQPHLKPIESALSPVIVIGQCDVSMSAHDNILRRPVWSFRHCASTFPRD 624 

Qy 561 SWSLRLKKQSCEGS-WEDVLHLGEEAPSHLYYCQLEASA CYVFTEQLGRFAL 611 

: I I : I I I I : : : I I I : : I I I : I I I I I 

Db 625 NWQFTL — YADEGSGWQKAWIGEENLNTl^BVQFEQPGKKNDGFGWCHVMTYSLARLML 682 

Qy 6i2 VGEAL — SVAAAKRLKLLLFAPVACTSLE — YNI RVYCLHDTHDALKEWQLEKQLGGQL 667 

I I : : I I I I : I : I I :: : : I I I I : : I I :: I : I I : I 

Db 683 AGHPRRNSLSAAKRVHLAVFGPTEMSAYRRPFELRVYCVPETGAAMESVWKQED— GSRL 740 

Qy 668 IQEPR — VLHFKDSYHNLRLSIHDV-PSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFT 724 

: I : I : I I I : I I I I : I I : I III: 
Db 741 LCESNDFILNEKG NLCICIEDVIPGFSCDGPEWEISETQHRFV AQNGLHCSLK 7 94 

Qy 725 LERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAF 784 

• • • ••• • * i i i * \ 

i ... i . i * . ••• ••iii • i 

Db 795 FRPKEINGSQFSTRVIVYQKASSTEPMVM — EVSNEPELYDATSEEREKGSVCV EF 848 

Qy 785 KIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKP- — SPTAMI LNLWEARH 842 

: : I I : : : : II I : I I I I I : I I I I : I I I I I I I I I : : : I : I I I I 
Db 849 RLPFGWDEIARLLDMPNESHSDWRGLAKKLHYDRYLQFFASFPDCSPTSLLLDLWEASS 908 

Qy 843 FPNGN-LSQLAAAVAGLGQPDA 863 

: : I : : I : I I I 
Db 909 S GS ARAVP DLLQT LRVMGRP DA 930 



RESULT 3 
T00026 

brain-specific angiogenesis inhibitor 1 - human 
N; Alternate names: BAI1 protein 
C; Species: Homo sapiens (man) 

C;Date: 22-Jan-1999 #sequence_revision 22-Jan-1999 #text_change 09-Jul-2004 
C; Accession: T00026 

R;Nishimori, H. ; Shiratsuchi, T.; Urano, T.; Kimura, Y. ; Kiyono, K. ; Tatsumi, 
K.; Yoshida, S.; Ono, M. ; Kuwano, M. ; Nakamura, Y. 
submitted to the EMBL Data Library, June 1997 
A; Reference number: Z14064 
A;Accession: T00026 

A; Status: translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-1584 <NIS> 

A; Cross-references: UNIPROT : 014514 ; EMBL: ABO 052 97 ; NID:dll75078; PID:dl024528 

A; Experimental source: brain 

C; Genetics: 

A; Gene: GDB : BAI 1 

A;Cross-references: GDB: 9838088; OMIM: 602682 
A; Map position: 8q24-8q24 

F;4 08-462/Domain: thrombospondin type 1 repeat homology <THR3> 

Query Match 6.4%; Score 298.5; DB 2; Length 1584; 

Best Local Similarity 33.5%; Pred. No. 4.1e-13; 

Matches 78; Conservative 35; Mismatches 91; Indels 29; Gaps 11; 



Qy 



99 CQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEW 158 



Db 



I I : I I I : : I I : : I I I : I I I I I I I I I 
309 CNREACGPAGRTSSRSQSLRSTDARR REELGDEL QQ FGFPA- PQTGD PAAE- EW 360 



Qy 159 LRNEDLVDPSLDPWYITREHSLVVRQARLADTA^ 218 

: II : I I I : : : I : : : I : : : I : I 

Db 361 — SPWSVCSSTCGEGWQTR TRFCVSSSYSTQCSGPLREQRLCNNSAVCPVHG 410 

Qy 219 GWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTAC-ATLCP VDG 274 

I I : I I : I I : : I I I I :: I : I : I I II III I I III III 
Db 411 AWDEWSPWSLCSSTCGRGFRDRTRTCR — PPQFGGNPCEGPEKQTKFCNIALCPGRAVDG 468 

Qy 275 SWSPWSKWSACGLDCT HWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLC 324 

: I : I I I I I I I : I : I I I : I : I I I I I I : : I I : I I 

Db 469 NWNEWSSWSACSASCSQGRQQRTRECNGPS--YGGAECQGHWVETRDCFLQQC 519 



RESULT 4 
TSHUP2 

thrombospondin 2 precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 19-May-1995 #sequence_revision 03-Aug-1995 #text_change 09-Jul-2004 
C;Accession: A47379; A42173 
R;LaBell, T.L.; Byers, P.H. 
Genomics 17, 225-229, 1993 

A; Title: Sequence and characterization of the complete human thrombospondin 2 

cDNA: potential regulatory role for the 3 ? untranslated region. 

A; Reference number: A47379; MUID: 94010892 ; PMID: 8406456 

A;Accession: A47379 

A;Molecule type: mRNA 

A; Residues: 1-1172 <LAB> 

A; Cross-references: UNIPROT : P35442 ; GB:L12350; NID:g307505; PIDN: AAA03703 . 1 ; 
PID:g307506 

R;LaBell, T.L.; Milewicz, D.J.; Disteche, CM. ; Byers, P.H. 
Genomics 12, 421-429, 1992 

A; Title: Thrombospondin II: partial cDNA sequence, chromosome location, and 

expression of a second member of the thrombospondin gene family in humans. 

A; Reference number: A42173; MUID : 92217961 ; PMID: 1559694 

A; Access ion: A42173 

A; Molecule type: mRNA 

A; Residues: 560-1172 <LA2> 

A; Cross-references : GB:M81339 

A; Experimental source: fibroblast 

A;Note: sequence extracted from NCBI backbone (NCBIN : 95091, NCBIP: 95096) 
C; Genetics : 

A; Gene: GDB: THBS2 ; TSP2 

A; Cross-references:. GDB:128789; OMIM:188061 
A; Map position: 6q27-6q27 
C; Complex: homotrimer, disulfide linked 
C; Function: 

A; Description: participates in cell migration and adhesion, and in platelet 
aggregation 

C; Super family: thrombospondin 1; EGF homology; thrombospondin type 1 repeat 
homology; von Willebrand factor type C repeat homology 
C; Keywords: beta-hydroxyasparagine; calcium binding; cell adhesion; 
glycoprotein; trimer 

F; 1-18/Domain: signal sequence #status predicted <SIG> 

F; 19-1172/Product : thrombospondin 2 #status predicted <MAT> 



F;319-377/Domain: von Willebrand factor type C repeat homology <VWC> 
F;380-431/Domain: thrombospondin type 1 repeat homology <THR1> 
F;436-492/Domain: thrombospondin type 1 repeat homology <THR2> 
F;493-549/Domain: thrombospondin type 1 repeat homology <THR3> 
F;553-588/Domain: EGF homology <EGF1> 
F;652-691/Domain: EGF homology <EGF> 
F;928-930/Region: cell attachment (R-G-D) motif 

F;151, 316, 330, 457, 584, 710, 1069/Binding site: carbohydrate (Asn) (covalent) 
#status predicted 

F;167-226/Disulfide bonds: #status predicted 
F;266,270/Disulfide bonds: interchain tfstatus predicted 

F; 612/Modif ied site: erythro-beta-hydroxyasparagine (Asn) #status predicted 

Query Match 6.4%; Score 296.5; DB 1; Length 1172; 

Best Local Similarity 30.5%; Pred. No. 3.8e-13; 

Matches 78; Conservative 28; Mismatches 105; Indels 45; Gaps 9; 

ARLADTANYTCVAKNIVARRRSASAA-VIVTWGGWSTWTEWSVCSAS 242 
I I : I I : : I I II : : I I I I I : I I I I : I I I II 



I :| I II hi : II I I : I I I I I I I I I I I : I hi I 



I I : I I : I I : : I I III II 

1PEPQYGGKACVGDVQERQMCNKRSC PVDGCLSNPCFPGAQC 564 



Qy 


184 


Db 


403 


Qy 


243 


Db 


463 


Qy 


299 


Db 


523 


Qy 


359 


Db 


565 


Qy 


410 


Db 


616 



)SDVADSSILTSGFQPVSI — KPSKADNPHLLTIQPDLSTTTT TYQ 409 

I I I : I I I I : : : : I I : : I : I 

-SSFPDGS-WSCGFCPVGFLGNGTHCEDLDECALVPDICFSTSKVPRCWTQP 615 



II I I I I 



RESULT 5 
JC5928 

semaphorin F precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 10-Apr-1998 #sequence_revision 08-May-1998 #text_change 09-Jul-2004 
C; Accession: JC5928 

R;Simmons, A.D.; Pueschel, A.W. ; McPherson, J.D.; Overhauser, J.; Lovett, M. 
Biochem. Biophys . Res. Commun. 242, 685-691, 1998 

A; Title: Molecular cloning and mapping of human semaphorin F from the Cri-du- 
chat candidate interval . 

A;Reference number: JC5928; MUID: 98125554 ; PMID:9464278 
A; Accession: JC5928 

A; Status: nucleic acid sequence not shown 
A; Molecule type: mRNA 
A; Residues: 1-1074 <SIM> 

A; Cross-references: UNIPROT:Q13591; GB:U52840; NID : g2772583; PIDN : AAC09473 . 1 ; 
PID:g2772584 

A; Experimental source: brain 



C; Comment: This protein disrupts normal brain development and leads to some of 
the features of Cri-du-chat. 
C; Genetics : 
A; Gene: semaf 

C;Superfamily: human semaphorin F; thrombospondin type 1 repeat homology 

F; 1-20/Domain: signal sequence tfstatus predicted <SIG> 

F; 50-533/Domain: semaphorin #status predicted <SEM> 

F; 840-896/Domain: thrombospondin type 1 repeat homology <THR3> 

F;971-993/Domain: transmembrane #status predicted <TMM> 

Query Match 6.3%; Score 293; DB 2; Length 1074; 

Best Local Similarity 45.8%; Pred. No. 6.1e-13; . 

Matches 54; Conservative 11; Mismatches 49; Indels 4; Gaps 2; 

Qy 216 VN GGWS TWT EW SVC S AS C GRGWQKRS RS CTN PAP LN GGAFC EGQNVQKTACAT L- C PVD G 274 

I It I I I I I I t I 111:111111 II II::: I I I I I I I I 
Db 783 VNGAWSAWTSWSQCSRDCSRGIRNRKRVCNNPEPKYGGMPCLGPSLEYQECNTLPCPVDG 842 

Qy 275 SWSPWSKWSACGLDC THWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSAS 329 

I I II I : I I : I : I I I : I I I I I : I I : I : I II 
Db 843 WSCWSPWTKCSATCGGGHYMRTRSCSNPAPAYGGDICLGLHTEEALCNTQPCPESWS 900 



RESULT 6 
A42587 

thrombospondin 2 precursor - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 04-Mar-1993 #sequence_revision 18-Nov-1994 #text_change 09-Jul-2004 
C;Accession: A42587; A39851 

R;Laherty, CD.; O'Rourke, K. ; Wolf, F.W. ; Katz, R. ; Seldin, M.F.; Dixit, V.M. 
J. Biol. Chem. 267, 3274-3281, 1992 

A; Title: Characterization of mouse thrombospondin 2 sequence and expression 
during cell growth and development. 

A; Reference number: A42587; MUID: 92147683 ; PMID: 1371115 
A; Access ion: A42587 

A; Status: preliminary; not compared with conceptual translation 
A; Molecule type: nucleic acid 
A; Residues: 1-1172 <LAH> 

A; Cross-references: UNIPROT : Q03350 ; GB:L07803; GB:M87275; NID:g340421; 
PIDN:AAA53064.1; PID:g567241 

A;Note: sequence extracted from NCBI backbone (NCBIP : 81502 ) 

R;Bornstein, P.; O'Rourke, K. ; Wikstrom, K. ; Wolf, F.W.; Katz, R. ; Li,* P.; 
Dixit, V.M. 

J. Biol. Chem. 266, 12821-12824, 1991 

A; Title: A second, expressed thrombospondin gene (Thbs2) exists in the mouse 
genome . 

A; Reference number: A39851; MUID: 91302287 ; PMID: 1712771 
A; Access ion: A39851 
A; Status: preliminary 
A; Molecule type: mRNA 
A; Residues: 1-873 <BOR> 

A; Cross-references: GB:M64866; NID:g201994; PIDN:AAA40432 . 1; PID:g201995 
C; Super family : thrombospondin 1; EGF homology; thrombospondin type 1 repeat 
homology; von Willebrand factor type C repeat homology 
C; Keywords: calcium binding; glycoprotein 

F; 3 19- 37 7 /Domain : von Willebrand factor type C repeat homology <VWC> 
F; 380-431/Domain: thrombospondin type 1 repeat homology <THR1> 



F; 436-4 92 /Domain: thrombospondin type 1 repeat homology <THR2> 

F; 493-54 9/Domain : thrombospondin type 1 repeat homology. <THR3> 

F;553-588/Domain: EGF homology <EGF1> 

F;652-691/Domain: EGF homology <EGF> 

Query Match 6.3%; Score 293; DB 2; Length 1172; 

Best Local Similarity 38.0%; Pred. No. 6.9e-13; 

Matches 60; Conservative 22; Mismatches 66; Indels 10; Gaps 5; 



Qy 184 RQARLADTAN YTCVAKN I VARRRS - ASAAVI VYVNGGWSTWT EWS VC S AS CGRGWQKRS R 242 

:: I I : II: :| I I : I I I I I I : I I I I : I I I II 

Db 403 QRGRSCDVTSNTCLGPSIQTRTCSLGKCDTRIRQNGGWSHWSPWSSCSVTCGVGNVTRIR 462 

Qy 243 SCTNPAPLNGGAFCEGQNVQKTAC-ATLCPVDGSWSPWSKWSACGLDCT HWRSRECS 298 

I : I I II I : I : I I I : I I T I I I I I I I I : I INI: 
Db 463 LCNS PVPQMGGKNCKGSGRETKPCQRDPCPIDGRWS PWS PWSACTVTCAGGI RERSRVCN 522 

Qy 299 DPAPRNGGEECQG — TD LDTRNCTSDLCVHSASGP 331 

I I : I I : : I I I : : I : I I I : : I 
Db 523 SPEPQYGGKDCVGDVTEHQMCNKRSCPIDGCLSNPCFP 560 



RESULT 7 
T18856 

angiogenesis inhibitor homolog - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 09-Jul-2004 
C;Accession: T18856; T24653 
R;McMurray, A. 

submitted to the EMBL Data Library, July 1995 
A;Reference number: Z19031 
A;Accession: T18856 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-1444 <WIL> 

A;Cross-references: UNIPROT : Q8MYA8 ; EMBL:Z50004; PIDN: CAA90293 . 1; GSPDB: GN00028 ; 
CESP:C02B4.1 

A; Experimental source: clone C02B4 
R;McMurray, A. 

submitted to the EMBL Data Library, July 1995 
A; Reference number: Z19917 
A;Accession: T24653 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: DNA 

A; Residues: 1-1444 <WI2>. 

A; Cross-references: EMBL:Z50006; PIDN : CAA90302 . 1; GSPDB : GN00028 ; CESP:C02B4.1 

A; Experimental source: clone T07C5 

C; Genetics : 

A; Gene: CESP:C02B4.1 

A; Map position: X 

A;Introns: 25/3; 70/3; 96/3; 139/3; 187/1; 234/2; 282/3; 376/2; 422/2; 478/3; 
509/3; 566/2; 625/1; 696/2; 786/3; 812/2; 878/3; 971/1; 1007/3; 1067/1; 1099/3; 
1180/3; 1273/2; 1305/1; 1363/1; 1388/2 



Query Match 5.9%; Score 276; DB 2; Length 1444; 

Best Local Similarity 27.2%; Pred. No. 1.6e-ll; 

Matches 73; Conservative 28; Mismatches 97; Indels 70; Gaps 12; 



Qy 98 WCQCVAWSSSGTTKSQKAYIRIARLRKNFEQ EPLAKEVSLEQGIVLPCRPPEGI 151 

I : : I I : : : I : I I : III : I I I 

Db 1134 WSEWSSWSAC SCFSLTSTRRRFCQWDPTVQGFCAGAILEQ IPCAPGSCS 1183 

Qy 152 PPAE VEW LRNEDLVDPSLDPNVYITREHSLWRQARLADTAN 193 

II II : | | : I : | | : 

Db 1184 PSAGGWSLWSEWSSCSKDCGDTGHQIRNRMCSEP IPSNRGAYCSG 1228 

Qy 194 YT CVAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPA 248 

I : II I : : : I : I I I : I I I I I : I I : I : I I I I 

Db 1229 YSFDQRPCVMDNVCSDEK VDGGWTDWTAWSECTDYCRNGHRSRTRFCANPK 1279 

Qy 249 PLNGGAFCEGQNVQKTAC — ATLCPV-DGSWSPWSKWSACGLDC THWRS REC S D PAP 302 

I Mill:: I I : I I I I I I I : I I I I I I I I 

Db 1280 PSQGGAQCTGSDFELNPCFDPARCHLRDGGWSTWSDWTPCSASCGFGVQTRDRSCSSPEP 1339 

Qy 303 RNGGEECQGTDLDTRNCTSDLCVHSASG 330 

: I I : I I I I 11:1 

Db 1340 K-GGQSCSGLAHQTSLCDLPACDHESDG 1366 



RESULT 8 
T00326 

hypothetical protein KIAA0550 - human 
C; Species: Homo sapiens (man) 

C;Date: Ol-Feb-1999 #sequence_revision Ol-Feb-1999 #text_change 15-Mar-2004 
C; Accession: T00326 

R;Nagase, T.; Ishikawa, K. ; Miyajima, N.; Tanaka, A.; Kotani, H. ; Nomura, N.; 
Ohara, O. 

DNA Res. 5, 31-39, 1998 

A;Title: Prediction of the coding sequences of unidentified human genes. IX. The 
complete sequences of 100 new cDNA clones from brain which can code for large 
proteins in vitro. 

A; Reference number: Z14086; MUID : 98290545; PMID: 9628581 
A; Accession: T00326 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-984 <NAG> 

A; Cross-references: EMBL: AB011122 ; NID: g3043623 ; PIDN : BAA25476 . 1 ; PID:g3043624 

A; Experimental source: brain 

C; Genetics : 

A; Note: KIAA0550 

F;344-398/Domain: thrombospondin type 1 repeat homology <THR3> 

Query Match 5.9%; Score 275; DB 2; Length 984; 

Best Local Similarity 39.0%; Pred. No. l.le-11; 

Matches 57; Conservative 20; Mismatches 53; Indels 16; Gaps 6; 

Qy 195 TCVA KNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPA 248 

III: : I : : hi I I : I I : I I : I I I I : I : I I I I 

Db 317 TCVSPYGTHCSGPLRESRVCNNTALCPVHGVWEEWSPWSLCSFTCGRGQRTRTRSCT — P 374 



Qy 

Db 



249 PLNGGAFCEGQNVQKTAC-ATLCPVDGSWSPWSKWSACGLDC THWRS REC SD PAP RN 304 

I II III I I I I I I I I I I I I I : I I I I I : I : I : 

375 PQYGGRPCEGPETHHKPCNIALCPVDGQWQEWSSWSQCSVTCSNGTQQRSRQCT— AAAH 432 



Qy 305 GGEECQGTDLDTRNCTSDLCVHSASG 330 

II 11:1 ::| | : | :|:| 
Db 433 GGSECRGPWAESRECYNPEC — TANG 456 



RESULT 9 
T00028 

brain-specific angiogenesis inhibitor 3 - human 
N; Alternate names: BAI3 protein 
C; Species: Homo sapiens (man) 

C;Date: 22-Jan-1999 #sequence_revision 22-Jan-1999 #text_change 09-Jul-2004 
C;Accession: T00028 

R; Shiratsuchi, T.; Nishimori, H.; Ichise, H.; Nakamura, Y. ; Tokino, T. 
Cytogenet. Cell Genet. 79, 103-108, 1997 

A;Title: Cloning and characterization of BAI2 and BAI3, novel genes homologous 

to brain-specific angiogenesis inhibitor 1 (BAI 1) . 

A; Reference number: Z14066; MUID: 98194217 ; PMID: 9533023 

A; Accession: T00028 

A; Status: translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA ' 
A; Residues: 1-1522 <SHI> 

A; Cross-references: UNIPROT: 060242 ; EMBL : AB005299 ; NID : g3021700 ; 

PIDN:BAA25363.1; PID:g3021701 

A; Experimental source: brain 

C; Genetics: 

A; Gene : GDB : BAI 3 

A; Cross-references: GDB:9838090; OMIM:602684 
A;Map position: 6ql2-6ql2 

F; 344-398/Domain: thrombospondin type 1 repeat homology <THR3> 

Query Match 5.9%; Score 275; DB 2; Length 1522; 

Best Local Similarity 39.0%; Pred. No. 2e-ll; 

Matches 57; Conservative 20; Mismatches 53; Indels 16; Gaps 6; 

Qy 195 TCVA KNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPA 248 

III: ' : I : : hi I I : I I : I I : I I I I : I : I I I I 

Db 317 TCVSPYGTHCSGPLRESRVCNNTALCPVHGVWEEWSPWSLCSFTCGRGQRTRTRSCT— P 374 

Qy 249 PLNGGAFCEGQNVQKTAC-ATLCPVDGSWSPWSKWSACGLDC THWRSRECSDPAPRN 304 

I II III I I I I I I I I I I I I I : I I I I I : I : I : 

Db 375 PQYGGRPCEGPETHHKPCNIALCPVDGQWQEWSSWSQCSVTCSNGTQQRSRQCT-- AAAH 432 



Qy 305 GGEECQGTDLDTRNCTSDLCVHSASG 330 

Mil:! ::| I : I :|:| 
Db 433 GGSECRGPWAESRECYNPEC— TANG 456 



RESULT 10 
T00027 

brain-specific angiogenesis inhibitor 2 - human 
N; Alternate names: BAI2 protein 
C; Species: Homo sapiens (man) 

C;Date: 22-Jan-1999 #sequence_revision 22-Jan-1999 #text_change 09-Jul-2004 
C; Accession: T00027 . 

R; Shiratsuchi, T.; Nishimori, H.; Ichise, H.; Nakamura, Y. ; Tokino, T. 
Cytogenet. Cell Genet. 79, 103-108, 1997 



A; Title: Cloning and characterization of BAI2 and BAI3, novel genes homologous 

to brain-specific angiogenesis inhibitor 1 (BAI 1) . 

A; Reference number: Z14066; MUID: 98194217 ; PMID: 9533023 

A;Accession: T00027 

A; Status: translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-1572 <SHI> 

A;Cross-references: UNI PROT : 060241 ; EMBL: AB005298; NID : g3021698 ; 

PIDN:BAA25362.1; PID:g3021699 

A; Experimental source: brain 

C; Genetics : 

A; Gene: GDB: BAI2 

A; Cross-references: GDB: 9838089; OMIM: 602683 
A; Map position: Ip35-lp35 

Query Match 5.9%; Score 274.5; DB 2; Length 1572; 

Best Local Similarity 19.2%; Pred. No. 2.2e-ll; 

Matches 176; Conservative 108; Mismatches 307; Indels 327; Gaps 38; 

Qy 148 PEGI PPAEVEWLRNEDLVDPSLDPNVY 1 T REH S L WRQARL 188 

I I I : : I I : I : I : I : I I I I I 

Db 271 PEEEPKVKTQWPRSAD EPGLYMAQTGDPAAEEWSPWSVCSLTCGQGLQVR-TRS 323 

Qy 189 ADTANYTCVAKNIVARRRSASAAVIVTWGGWSTWTEWSVCSASCGRGWQKRSRSCTNPA 248 

: : I : : I : : I : I I I I I : I I I I I I I : I I : I 
Db 324 CVSSPYGTLCSGPLRETRPCNNSATCPVHGVWEEWGSWSLCSRSCGRGSRSRMRTCV— P 381 

Qy 249 PLNGGAFCEGQNVQKTACA-TLCPVDGSWSPWSKWSACGLDC THWRSRECSDPAPR- 303 

I : I I II I : I I : I I I : I I I I I I I I I I : I I I 
Db 382 PQHGGKACEGPELQTKLCSMAACPVEGQWLEWGPWGPCSTSCANGTQQRSRKCSVAGPAW 441 

Qy 304 NGGEECQ 310 

t I : 

Db 442 ATCTGALTDTRECSNLECPATDSKWGPWNAWSLCSKTCDTGWQRRFRMCQATGTQGYPCE 501 

Qy 311 GTDLDTRNCTSDLC — VHSASGPEDVAL 336 

I I : : I : I I III 
Db 502 GTGEEVKPCSEKRCPAFHEMCRDEYVMLMTWKKAAAGEIIYNKCPPNASGSASRRCLLSA 561 



Qy 337 YVGL I AVAVC L VLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKA 389 

I I I : I I : I : : : I : : : I : : I I : : : 

Db 562 QGVAYWGLPS FARCI SHEYRYL YLS LREHLAKGQRMLAGEGMSQVVRS - LQELLARRT YY 620 

Qy 390 DNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLT NGHLLSPLGG 436 

I :: : | | : : | | ||: : :lt I 

Db 621 SGDLLFSVDILRNVTDTFKRATYVPSADDVQRFFQWSFMVDAENKEKWDDAQQVSP--G 678 

Qy 437 GRHTLHHSSPTSEAEEFV SRLSTQNYFRSLPRG TSNMTYGTFN 479 

I I I : I : I : I I I : I : I : : I : 

Db 679 SVHLLR WEDFIHLVGDALKAFQSSLIVTDNLVISIQREPVSAVSSDITFPMRG 732 

Qy 480 FLG GRLMI PNTGI SLLI P PDAIPRGK 505 

I I I : I : I I I I : I I 

Db 733 RRGMKDWVRHSEDRLFLPKEVLSLSSPGKPATSGAAGSPGRGRGPGTVPPGPGHSHQRLL 792 



Qy 506 IYE-IYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVIL 549 

:| : I I I I III ::: |: II |:| 



Db 



793 PADPDESSYFVIGAVLYRTLGLILPPP RPPLAVTSRVMT — VTVRPPTQPPAEPLIT 847 



Qy 550 A MDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQ-LEASACYV — 601 

:: :| II : : I |: Mill: 

Db 84 8 VELSYIINGTTDPHCASWDYS-RADASSGDWD TENCQTLETQAAHTRC 894 

Qy 602 FT EQLGRFALVGE AL S VAAAKRL KL L L FAP VACT S L E YN I RVYC LH DT HDALKEV 656 

till::: I :| : : |:: |:| :| : :| I 

Db 895 QCQHLSTFAVLAQPPKDLTLELAGSPSVPLVIGCAVSCMALLTLLAIYA AFWRF 948 



Qy 657 VQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQ 716 

:: I: : I III : |:: ■ :|| : : I 
Db 949 IKSERSI ILLNFCLSI — LASNI LILVGQSRVLSKGVCTMTA 988 

Qy 717 RYLHCTFTLERVSPSTSDLACKLWV WQVEGDG 748 

: I I I I : I I I : 

Db 989 AFLHFFF LSSFCWVLTEAWQSYLAVIGRMRTRLVRKRFLCLGWGLPALV 1037 



Qy 749 QSFSINFNITKDTRFAELLALESEAG-VPALVGPSA FKIPFLIRQKI IS 796 

: I : I I I : I I I : I I I I : I I : : I : II 

Db 1038 VAVS VGFTRTKGYGTS S YCWLS LEGGLL YAFVGPAAVI VLVNMLI GI I VFNKLMARDGI S 1097 



Qy 797 SLDPPCRRGAD WRTL 811 

II:: I : I 
Db 1098 DKSKKQRAGSERCPWASL 1115 



RESULT 11 
A40558 

thrombospondin 1 precursor - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 05-Jun-1992 #sequence_revision 05-Jun-1992 #text_change 09-Jul-2004 
C;Accession: A40558; A37905; B42587; S68787" 

R;Lawler, J.; Duquette, M. ; Ferro, P.; Copeland, N.G.; Gilbert, D.J.; Jenkins, 
N.A. 

Genomics 11, 587-600, 1991 

A; Title: Characterization of the murine thrombospondin gene. 

A; Reference number: A40558; MUID : 92128941 ; PMID: 1774063 

A; Accession: A40558 

A; Status: preliminary 

A; Molecule type: DNA 

A; Residues: 1-1170 <LAW> 

A; Cross-references: UNIPROT : P35441; GB:M62449; GB:M62450; GB:M62451; GB:M62452; 
GB:M62453; GB:M62454; GB:M62455; GB:M62456; GB:M62457; GB:M62458; GB:M62459; 
GB:M62460; GB:M62461; GB:M62462; GB:M62463; GB:M62464; GB:M62465; GB:M62466; 
GB:M62467; GB:M62468; GB:M62469; GB:.M62470; NID:g511867; PIDN : AAA50611 . 1 ; 
PID:g511869 

R;Bornstein, P.; Alfi, D. ; Devarayalu, S.; Framson, P.; Li, P. 
J. Biol. Chem. 265, 16691-16698, 1990 

A; Title: Characterization of the mouse thrombospondin gene and evaluation of the 

role of the first intron in human gene expression. 

A;Reference number: A37905; MUID : 90375546; PMID:2398070 

A; Accession: A37905 

A; Status: preliminary 

A; Molecule type: DNA 

A; Residues: 1-490 <BOR> 



A;Cross-references: GB:J05605; GB:J05606; NID:g201991; PIDN : AAA40431 . 1; 
PID:g554390 

R;Laherty, CD. ; O'Rourke, K. ; Wolf, F.W. ; Katz, R. ; Seldin, M.F.; Dixit, V.M. 
J. Biol. Chem. 267, 3274-3281, 1992 

A;Title: Characterization of mouse thrombospondin 2 sequence and expression 
during cell growth and development. 

A; Reference number: A42587; MUID: 92147683 ; PMID: 1371115 
A;Accession: B42587 

A; Status: preliminary; not compared with conceptual translation 
A; Molecule type: mRNA 

A;Residues: 1-1152, • P 1 , 1154-1170 <LAH> 
A; Cross-references : GB:M87276 

A;Note: sequence extracted from NCBI backbone (NCBIP : 81501 ) 
R;Chen, H.; Aeschlimann, D.; Nowlen, J.; Mosher, D.F. 
FEBS Lett. 387, 36-41, 1996 

A;Title: Expression and initial characterization of recombinant mouse 
thrombospondin 1 and thrombospondin 3. 

A;Reference number: S68787; MUID : 96234006; PMID: 8654563 

A; Accession: S68787 

A;Molecule type: protein 

A;Residues: 19-26, 1 X 1 , 28-37 <CHE> 

C; Complex: homotrimer, disulfide linked 

C; Super family : thrombospondin 1; EGF homology; thrombospondin type 1 repeat 

homology; von Willebrand factor type C repeat homology 

C; Keywords: calcium binding; glycoprotein; homotrimer 

F; 1-18/Domain: signal sequence #status predicted <SIG> 

F;19-1170/Product: thrombospondin 1 #status predicted <MAT> 

F;317-375/Domain: von Willebrand factor type C repeat homology <VWC> 

F; 378-429/Domain: thrombospondin type 1 repeat homology <THR1> 

F; 434-4 90/Domain: thrombospondin type 1 repeat homology <THR2> 

F; 491-547/Domain: thrombospondin type 1 repeat homology <THR3> 

F;551-586/Domain: EGF homology <EGF> 

F;248, 360, 708, 1067/Binding site: carbohydrate (Asn) (covalent) ftstatus predicted 

Query Match 5.8%; Score 270.5; DB 2; Length 1170; 

Best Local Similarity 32.2%; Pred. No. 3e-ll; 

Matches 57; Conservative 24; Mismatches 71; Indels 25; Gaps 5; 

Qy 182 WRQARLADT AN YT C VAKN I VAR RRSASAAVIVYVNGGWSTWTEWSVCSASC 233 

: : : I I : I I : : I : I : I I I I I : I I I I : I 

Db 399 IQQRGRSCDSLNNRCEGSSVQTRTCHIQECDKRFKQ DGGWSHWSPWSSCSVTC 451 

Qy 234 GRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTAC-ATLCPVDGSWSPWSKWSACGLDC 289 

II I I I : I : I I III: : II I I : : I I I I I I I : I 

Db 452 GDGVTTRIRLCNSPSPQMNGKPCEGEARETKACKKDACPINGGWGPWSPWDICSVTCGGG 511 

Qy 290 THWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVC 346 

I I I I : : I I : I I : : I I : : I I II I III 

Db 512 VQRRSRLCNNPTPQFGGKDCVGDVTENQVCNKQDC PIDGCLSNPCFAGAKC 562 



RESULT 12 
TSHUP1 

thrombospondin 1 precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 23-Aug-1987 #sequence_revision 03-Aug-1995 #text_change 09-Jul-2004 
C;Accession: A26155; A34274; A30140; A25812; A05172; A42927 



R;Lawler, J.; Hynes, R.O. 

J. Cell Biol. 103, 1635-1648,- 1986 

A; Title: The structure of human thrombospondin, an adhesive glycoprotein with 
multiple calcium-binding sites and homologies with several different proteins. 
A; Reference number: A26155; MUID : 87057617 ; PMID:2430973 
A;Accession: A26155 
A;Molecule type: mRNA 
A; Residues: 1-1170 <LAW> 

A;Cross-references: UNIPROT : P07996; GB:X04665; NID:g37137; PIDN : CAA28370 . 1 ; 
PID:g37138 

A;Note: parts of this sequence, including the amino end of the mature protein, 
were determined by protein sequencing 
R;Laherty, CD.; Gierman, T.M.; Dixit, V.M. 
J. Biol. Chem. 264, 11222-11227, 1989 

A; Title: Characterization of the promoter region of the human thrombospondin 

gene. DNA sequences within the first intron increase transcription. 

A; Reference number: A34274; MUID: 89291870 ; PMID:2544587 

A;Accession: A34274 

A;Molecule type: DNA 

A; Residues: 1-166 <LAH> 

A; Cross-ref erences : GB:J04835 

R;Hennessy, S.W.; Frazier, B.A.; Kim, D.D.; Deckwerth, T.L.; Baumgartel, D.M.; 

Rotwein, P.; Frazier, W.A. 

J. Cell Biol. 108, 729-736, 1989 

A; Title: Complete thrombospondin mRNA sequence includes potential regulatory 
sites in the 3 ! untranslated region. 

A; Reference number: A30140; MUID: 89139590 ; PMID:2918029 
A;Accession: A30140 
A; Molecule type: mRNA 

A; Residues: 1-83, 'A 1 , 85-522 , 1 A 1 , 524-1170 <HEN> 

A; Cross-references: EMBL:X14787; NID:g37464; PIDN : CAA32889 . 1 ; PID:g37465 

A; Note: parts of this sequence, including the amino end of the mature protein, 

were determined by protein sequencing 

R;Kobayashi, S.; Eden-McCutchan, F. ; Framson, P.; Bornstein, P. 
Biochemistry 25, 8418-8425, 1986 

A; Title: Partial amino acid sequence of human thrombospondin as determined by 
analysis of cDNA clones: homology to malarial circumsporozoite proteins. 
A;Reference number: A25812; MUID: 87157592 ; PMID:3030396 
A;Accession: A25812 
A;Molecule type: mRNA 
A;Residues: 1-83, 1 A 1 , 85-397 <KOB> 

A;Cross-references: GB:M25631; NID:g538353; PIDN: AAA36741 . 1 ; PID:g538354 

R;Dixit, V.M. ; Hennessy, S.W.; Grant, G.A. ; Rotwein, P.; Frazier, W.A. 

Proc. Natl. Acad. Sci. U.S.A. 83, 5449-5453, 1986 

A; Reference number: A05172; MUID: 86287276; PMID: 3461443 

A;Accession: A05172 

A;Molecule type: mRNA 

A;Residues: 1-83, 1 A 1 , 85-374 , f RC f <DIX> 

A;Cross-ref erences: GB:M14326; NID:g340005; PIDN : AAA61237 . 1 ; PID:g553801 
A;Note: parts of this sequence, including the amino end of the mature protein, 
were determined by protein sequencing 
R;Sun, X.; Skorstengaard, K. ; Mosher, D.F. 
J. Cell Biol. 118, 693-701, 1992 

A; Title: Disulfides modulate RGD-inhibitable cell adhesive activity of 
thrombospondin . 

A; Reference number: A42927; MUID: 92348511 ; PMID: 1379247 
A; Accession: A42927 



A;Molecule type: protein 
A; Residues: 987-1003 <SUN> 

A;Note: Cys-992 is shown to have a free sulfhydryl 
C;Genetics : 

A; Gene: GDB: THBS1; TSP1; TSP 
A;Cross-references: GDB: 120438; OMIM: 188060 
A; Map position: 15ql5-15ql5 
A;Introns: 23/1 

A; Note: the list of introns may be incomplete 
C; Complex: homotrimer , disulfide linked 
C; Function: 

A; Description: participates in cell migration and adhesion, and in platelet 
aggregation 

C; Super family: thrombospondin 1; EGF homology; thrombospondin type 1 repeat 
homology; von Willebrand factor type C repeat homology 
C;Keywords: beta-hydroxyasparagine; calcium binding; cell adhesion; 
glycoprotein; trimer 

F; 1-1 8 /Domain : signal sequence #status predicted <SIG> 
F;19-1170/Product: thrombospondin 1 #status predicted <MAT> 
F; 317-375/Domain: von Willebrand factor type C repeat homology <VWC> 
F; 37 8-42 9/ Domain : thrombospondin type 1 



F;434-490/Domain 
F;491-547/Domain 
F; 551-586/Domain 



repeat homology <THR1> 
repeat homology <THR2> 
repeat homology <THR3> 



thrombospondin type 1 
thrombospondin type 1 
EGF homology <EGF1> 
F;650-689/Domain: EGF homology <EGF2> 
F;926-928/Region: cell attachment (R-G-D) motif 
F; 171-232/Disulf ide bonds: #status predicted 

F;248, 360, 708, 1067/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F; 270, 274/Disulf ide bonds: interchain #status predicted 

F; 610/Modif ied site: erythro-beta-hydroxyasparagine (Asn) #status predicted 
F; 1051/Binding site: carbohydrate (Asn) (covalent) ttstatus absent 



Query Match 5.8%; Score 268.5; DB 1; Length 1170; 

Best Local Similarity 32.9%; Pred. No. 4.1e-ll; 

Matches 51; Conservative 24; Mismatches 61; Indels 19; Gaps 



4; 



Qy 

Db 



182 WRQARLADTAN YTCVAKN I VAR RRSASAAVIVYVNGGWSTWTEWSVCSASC 233 

: : : I I : I I :: I : I : I I I I I : I I I I : I 

399 IQQRGRSCDSLNNRCEGSSVQTRTCHIQECDKRFKQ DGGWSHWSPWSSCSVTC 451 



Qy 



Db 



234 GRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTAC-ATLCPVDGSWSPWSKWSACGLDC 289 

II I I I : I : I I III: : II I I : : I I I I I I I : I 
452 GDGVITRI RLCNSPSPQMNGKPCEGEARETKACKKDACPINGGWGPWSPWDICSVTCGGG 511 



Qy 



Db 



290 THWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLC 324 

I I I I :: I I : I I : : I I . .: : I I 
512 VQKRSRLCNNPTPQFGGKDCVGDVTENQICNKQDC 54 6 



RESULT 13 
A39804 

thrombospondin precursor - chicken 
C; Species: Gallus gallus (chicken) 

C;Date: 10-Sep-1999 #sequence_revislon 10-Sep-1999 #text_change 09-Jul-2004 
C;Accession: A39804 

R;Lawler, J.; Duquette, M. ; Ferro, P. 
J. Biol. Chem. 266, 8039-8043, 1991 



A; Title: Cloning and sequencing of chicken thrombospondin. 
A; Reference number: A39804; MUID : 91217026; PMID:2022631 
A; Accession: A39804 
A; Status : preliminary 
A; Molecule type: mRNA 
A; Residues: 1-1178 <LAW> 

A; Cross-references: UNIPROT: P35440; GB:M60853; NID:g212763; PIDN : AAA51437 . 1 ; 
PID:g212764 

C; Super family: thrombospondin 1; EGF homology; thrombospondin type 1 repeat 

homology; von Willebrand factor type C repeat homology 

F; 325-383/Domain: von Willebrand factor type C repeat homology <VWC> 

F; 386-437/Domain: thrombospondin type 1 repeat homology <THR1> 

F; 442-498/Domain: thrombospondin type 1 repeat homology <THR2> 

F; 499-555/Domain: thrombospondin type 1 repeat homology <THR3> 

F; 658-697/Domain: EGF homology <EGF> 

Query Match 5.6%; Score 263; DB 1; Length 1178; 

Best Local Similarity 36.2%; Pred. No. le-10; 

Matches 58; Conservative 16; Mismatches 70; Indels 16; Gaps 5; 

Qy 185 QARLADT AN YTCVAKN I VARRRS - AS AAVI VYVNGGW S T WT EW SVC S AS C GRGWQ KRS RS 243 

: I I I rill :: I I I I I : I I I I : I I I II 

Db 410 RGRSCDVTRSACTGPHIQTRMCS FKKCDHRI RQDGGWSHWS PWS SCSVTCGVGNITRIRL 469 

Qy 244 CTNPAPLNGGAFCEGQNVQKTACATL-CPVDGSWSPWSKWSACGLDC THWRSRECSD 299 

I : I I II II : I I I I : I I II I I I II : I Mil: 

Db 470 CNSPIPQMGGKNCVGNGRETEKCEKAPCPVNGQWGPWSPWSACTVTCGGGIRERSRLCNS 529 

Qy 300 PAPRNGGEECQGTDLDT RNCTSDLCVHSASGP 331 

I I : I I : I I II | : | | | : : I 

Db 530 PEPQYGGKPCVG DTKQHDMCNKRDCPIDGCLSNPCFP 566 



RESULT 14 
S29126 

properdin precursor [validated] - human 
N;Alternate names: factor P 
C; Species: Homo sapiens (man) 

C;Date: 17-Nov-2000 #sequence_revision 17-Nov-2000 #text_change 09-Jul-2004 
^Accession: S29126; S16150; A05319; T45112; T45113 

R;Nolan, K.F.; Kaluz, S.; Higgins, J.M.G.; Goundis, D.; Reid, K.B.M. 
Biochem. J. 287, 291-297, 1992 

A; Title: Characterization of the human properdin gene. 

A; Reference number: S29126; MUID: 93038568 ; PMID: 1417780 

A; Accession: S29126 

A; Molecule type: DNA 

A; Residues: 1-469 <N0L1> 

A; Cross-references: UNIPROT : P27918 ; EMBL:X70872; NID:g35679; PIDN : CAA50220 . 1; 
PID:g35680 

R;Nolan, K.F.; Schwaeble, W. ; Kaluz, S.; Dierich, M.P.; Reid, K.B.M. 
Eur. J. Immunol. 21, 771-776, 1991 

A; Title: Molecular cloning of the cDNA coding for properdin, a positive 
regulator of the alternative pathway of human complement. 
A;Reference number: S16150; MUID: 91184288 ; PMID: 2009915 
A; Accession: S 16150 
A; Molecule type: mRNA 

A; Residues: 1-456, 1 R f , 458-469 <NOL2> 



A; Cross-references : EMBL:X57748 

R;Reid, K.B.M.; Gagnon, J. 

Mol. Immunol. 18, 949-959, 1981 

A; Reference number: A05319; MUID: 82195224 ; PMID:7341961 

A; Accession: A05319 

A; Molecule type: protein 

A; Residues: 28-53, 'Q', 55-59, f G',61, • 1 1 , 63; 137-138 , 'P', 140-141, 'P»,143- 

144, 'X', 146-148, 'Y',150, , S , ,152, 'Y', 154-156, ' XSXGXA' ; 162-163, 'E',165- 

172, , X f , 174-176, , X , ,178, 'V , 180; 223-228 , 'X», 230-232, * GX * , 235-238, 'GH',241- 

245; 248-251, , X* ,253-257, f P ! ,259, »G' ,261, • XPP ' , 265-266, ' X 268-269; 280- 

285, 'X', 287-290, , X , ,292, 'H', 294-300, 1 SXXX 1 , 305-307 , , X I , 309-315, »K»,317;333- 

341,343-357, 'X', 359-362, 1 EXE 1 ; 393-404, 1 QK' , 407; 421-427, ' R' , 429-443, »TKV , 447- 

448, 'XX', 451, 'RX', 454-455 <REI> 

R;Westberg, J.; Nordin-Fredrikson, G.; Truedsson, L. ; Sjoholm, A.G.; Uhlen, M. 
submitted to the EMBL Data Library, May 1997 
A;Reference number: Z22914 
A;Accession: T45112 

A; Status: translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 

A;Residues: 1-54, 'X' , 56-73, 'X' , 75-99, 'W» , 101-469 <WES1> 
A; Cross-references: EMBL: AF005665 ; PIDN : AAB63280 . 1 

A; Experimental source: genomic DNA from individual with properdin deficiency 
type II 

A; Accession: T45113 

A; Status: translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 

A; Residues: 1-60, 'X', 62-4 13, 1 D 1 , 415-452 , 1 XX 1 , 455-469 <WE2> 
A;Cross-references: EMBL: AF005666; PIDN : AAC51626 . 1 

A; Experimental source: genomic DNA from individual with properdin deficiency 
type III 

R;Hartmann, S.; Hofsteenge, J. 

J. Biol. Chem. 275, 28569-28574, 2000 

A; Title: Properdin, the positive regulator of complement, is highly C- 
mannosylated. 

A; Reference number: A59360; MUID : 20435812 ; PMID : 10878002 
A; Contents : annotation 

A; Note: identification and location of C-mannosylation sites by mass- 

spectroscopy 

C; Genetics : 

A; Gene: GDB : PFC 

A; Cross-references: GDB: 120275; OMIM: 312060 
A; Map position: Xpll . 3-Xpll . 23 

A;Introns: 26/1; 76/2; 135/1; 192/1; 256/1; 314/1; 378/1; 415/2 
C; Complex: a mixture of homodimers, homot rimers and homotetramers 
C; Function: 

A; Description : protects C3 convertase (C3bBb) from rapid inactivation 
A; Pathway: complement alternate pathway 

C; Superf amily : human properdin precursor; thrombospondin type 1 repeat homology 
C; Keywords: complement alternate pathway; glycoprotein; homodimer; homotetramer ; 
homot rimer; plasma 

F; 1-27/Domain: signal sequence #status predicted <SIG> 

F; 28-469/Product : properdin #status experimental <MAT> 

F; 7 6- 12 8 /Domain: thrombospondin type 1 repeat homology <THR1> 

F; 135-191/Domain: thrombospondin type 1 repeat homology <THR2> 

F; 192 -2 5 5/ Domain: thrombospondin type 1 repeat homology <THR3> 

F;256-313/Domain: thrombospondin type 1 repeat homology <THR4> 

F;314-377/Domain: thrombospondin type 1 repeat homology <THR5> 



F; 378-440/Domain: thrombospondin type 1 repeat homology <THR6> 

F;83, 86,139,142,145,196, 199,260, 263,321,324,382,385, 388/Modif ied site: 2'- 

mannosyl-tryptophan (Trp) #status experimental 

F ; 4 2 8 / Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 5.2%; Score 243; DB 1; Length 469; 

Best Local Similarity 39.5%; Pred. No. 8.7e-10; 

Matches 45; Conservative 14; Mismatches 43; Indels 12; Gaps 4; 

Qy 218 GGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACAT — LCPVDGS 275 

I I II I I II : I : I : I I : I : I I I II III : III : I I I : 
Db 137 GGWSGWGPWEPCSVTCSKGTRTRRRACNHPAPKCGG-HCPGQAQESEACDTQQVCPTHGA 195 

Qy 276 WSPWSKWSACGLDC THWRS REC S D PAP — RNGGEECQGTDLDTRNCT 320 

I : I I : I I | | | : | | | | : | : | | : | | | 

Db 196 WATWGPWTPCSASCHGGPHEPKETRSRKCSAPEPSQKPPGKPCPGLAYEQRRCT 249 



RESULT 15 
S0S478 

properdin - mouse (fragment) 

C; Species: Mus musculus (house mouse) 

C;Date: 07-Sep-1990 #sequence_revision 07-Sep-1990 #text_change 09-Jul-2004 
C; Accession: S 054 7 8 
R;Goundis, D.; Reid, K.B.M. 
Nature 335, 82-85, 1988 

A; Title: Properdin, the terminal complement components, thrombospondin and the 
circumsporozoite protein of malaria parasites contain similar sequence motifs. 
A;Reference number: S05478; MUID: 88318954 ; PMID:3045564 
A;Accession: S05478 
A;Molecule type: mRNA 
A; Residues: 1-437 <GOU> 

A;Cross-references: UNIPROT : P11680 ; EMBL:X12905; NID:g53786; PIDN : CAA31389 . 1 ; 
PID:g53787 

C; Complex: a mixture of homodimers, homotrimers and homotetramefs 
C; Function: 

A; Description: protects C3 convertase (C3bBb) from rapid inactivation 
A; Pathway: complement alternate pathway 

C; Super family: human properdin precursor; thrombospondin type 1 repeat homology 
C; Keywords: complement alternate pathway; glycoprotein; homodimer; homotetramer ; 
homo t rimer; plasma 

F; 45-97/Domain: thrombospondin type 1 repeat homology <THR1> 

F; 104-160/Domain: thrombospondin type 1 repeat homology <THR2> 

F; 161-224/Domain: thrombospondin type 1 repeat homology <THR3> 

F; 2 2 5-2 8 2/ Domain: thrombospondin type 1 repeat homology <THR4> 

F; 2 8 3- 3 4 5/ Domain: thrombospondin type 1. repeat homology <THR5> 

F; 34 6-408/Domain: thrombospondin type 1 repeat homology <THR6> 

F; 52, 55, 108, 111, 114, 165, 168, 229, 232, 290, 293, 350, 353, 356/Modif ied site: 2'- 

mannosyl-tryptophan (Trp) #status predicted 

F; 366, 396/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 4.9%; Score 229; DB 2; Length 437; 

Best Local Similarity 40.4%; Pred. No. 8.3e-09; 

Matches 46; Conservative 10; Mismatches 46; Indels 12; Gaps 4; 



Qy 



218 GGW S TWT EWS VC S AS C GRGWQ K RS RS C TN PAP LN GGAFC E GQNVQ KT AC AT — LC P VD G S 275 
Mill I I I : I : I I I I I I I I I II ||: I III | | I : 



Db 106 GGWSEWGPWGPCSVTCSKGTQIRQRVCDNP7VPKCGG-HCPGEAQQSQACDTQKTCPTHGA 164 



Qy 276 WSPWSKWSACGLDC THWRSRECSDPAPRN — GGEECQGTDLDTRNCT 320 

I : I I I I I I I I I I I I : I : I I : : I : 

Db 165 WASWGPWSPRSGSCLGGAQEPKETRSRSCSAPAPSHQPPGKPCSGPAYEHKACS 218 



Search completed: March 1, 2005, 09:07:18 
Job time : 36.0799 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: March 1, 2005, 09:06:01 ; 



Search time 168.022 Seconds 

(without alignments) 

1704.439 Million cell updates/sec 



Title: US-10-624-932-2_COPY_26_898 
Perfect score: 4660 

Sequence: 1 QQSATVANPVPGANPDLLPH AVAGLGQPDAGLFTVSEAEC 873 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



1385339 seqs, 328044528 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



1385339 



Post-processing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database 



Published__Applications_AA: * 

1 : / cgn2_6/ptodata/ l/pubpaa/US07_PUBCOMB . pep : * 

2 : / cgn2_6/p toda ta/ 1/pubpaa/ PCT_NEW_PUB . pep : * 

3 : / cgn2_6/p t oda ta / 1/pubpaa/US 0 6_NEW_PUB . pep : * 

4 : / cgn2_6/ptodata/ 1/pubpaa/US 06_PUBCOMB. pep: * 

5: / cgn2_6/ptodata/ 1/pubpaa/US 07_NEW_PUB. pep:* 

6: / cgn2_6/ptodata/ 1/pubpaa/ PCTUS_PUBCOMB. pep:* 

7: /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB.pep:* 

8: /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep:* 

9 : / cgn2_6/ptodata/ 1/pubpaa/US 0 9A_PUBCOMB . pep : * 

10: /cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB.pep:* 

11: /cgn2_6/ptodata/l/pubpaa/US09C_PUBCOMB.pep:* 

12 : /cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB . pep : * 

-13: /cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB.pep:* . 

14: /cgn2_6/ptodata/l/pubpaa/US10B_PUBCOMB.pep:* 

15 : / cgn2_6/ptodata/ 1/pubpaa/USl 0C_PUBCOMB . pep : * 

16: /cgn2_6/ptodata/l/pubpaa/US10D_PUBCOMB.pep:* 

17 : /cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB . pep : * 

18: /cgn2_6/ptodata/l/pubpaa/USllJfEW_PUB.pep:* 

19: /cgn2_6/ptodata/l/pubpaa/US60_NEW_PUB.pep:* 

20: /cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB.pep:+ 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 



No. 


Score 


Match 


Length 


DB 


ID 










Description 


1 


4660 


100. 


0 


898 


10 


us- 


09- 


918- 


779- 


2 


Sequence 


2, Appli 


2 


4660 


100. 


0 


898 


15 


us- 


10- 


624- 


932- 


2 


Sequence 


2, Appli 


3 


4567 . 5 


98. 


0 


899 


10 


us- 


09- 


970- 


944- 


2 


Sequence 


2, Appli 


4 


4511 


96. 


8 


898 


10 


us- 


09- 


933- 


261- 


5 


Sequence 


5, Appli 


5 


4511 


96. 


8 


898 


10 


US- 


09- 


970- 


944- 


13 


Sequence 


13, Appl 


6 


4511 


96. 


8 


898 


14 


US- 


10- 


256- 


702- 


5 


Sequence 


5, Appli 


7 


4511 


96. 


8 


898 


14 


us- 


10- 


240- 


154- 


16 


Sequence 


16, Appl 


8 


4282 


91. 


9 


842 


15 


us- 


10- 


311- 


623- 


1 


Sequence 


1, Appli 


9 


2845 


61. 


1 


544 


10 


us- 


09- 


970- 


944- 


14 


Sequence 


14, Appl 


10 


2815. 5 


60. 


4 


557 


10 


us- 


09- 


933- 


261- 


6 


Sequence 


6, Appli 


11 


2815. 5 


60. 


4 


557 


14 


us- 


10- 


256- 


702- 


6 


Sequence 


6, Appli 


12 


2778.5 


59. 


6 


931 


10 


us- 


09- 


970- 


944- 


15 


Sequence 


15, Appl 


13 


2778.5 


59. 


6 


931 


11 


us- 


09- 


972- 


211- 


121 


Sequence 


121, App 


14 


2778.5 


59. 


6 


931 


15 


us- 


10- 


087- 


684- 


35 


Sequence 


35, Appl 


15 


2778.5 


59. 


6 


931 


15 


us- 


10- 


037- 


417- 


117 


Sequence 


117, App 


16 


2778.5 


59. 


6 


931 


15 


us- 


10- 


096- 


625- 


121 


Sequence 


121, App 


17 


2778. 5 


59. 


6 


1010 


15 


us- 


10- 


218- 


779- 


35 


Sequence 


35, Appl 


18 


2754.5 


59. 


1 


931 


10 


us- 


09- 


970- 


944- 


16 


Sequence 


16, Appl 


19 


2754.5 


59. 


1 


931 


11 


us- 


09- 


972- 


211- 


125 


Sequence 


125, App 


20 


2754.5 


59. 


1 


931 


15 


us- 


10- 


096- 


625- 


125 


Sequence 


125, App 


21 


2751.5 


59. 


0 


931 


15 


us- 


10- 


037- 


417- 


120 


Sequence 


120, App 


22 


2747. 5 


59. 


0 


931 


10 


us- 


09- 


970- 


944- 


17 


Sequence 


17, Appl 


23 


2747. 5 


59. 


0 


931 


11 


us- 


09- 


972- 


211- 


122 


Sequence 


122, App 


24 


2747.5 


59. 


0 


931 


15 


us- 


10- 


087- 


684- 


36 


Sequence 


36, Appl 


25 


2747.5 


59. 


0 


931 


15 


us- 


10- 


218- 


779- 


36 


Sequence 


36, Appl 


26 


2747.5 


59. 


0 


931 


15 


us- 


10- 


037- 


417- 


118 


Sequence 


118, App 


27 


2747.5 


59. 


0 


931 


15 


us- 


10- 


037- 


417- 


119 


Sequence 


119, App 


28 


2747.5 


59. 


0 


931 


15 


us- 


10- 


096- 


625- 


122 


Sequence 


122, App 


29 


2560.5 


54. 


9 


943 


10 


us- 


09- 


933- 


261- 


7 


Sequence 


7, Appli 


30 


2560.5 


54. 


9 


943 


14 


us- 


10- 


256- 


702- 


7 


Sequence 


7, Appli 


31 


2560.5 


54. 


9 


945 


11 


us- 


09- 


972- 


211- 


124 


Sequence 


124, App 


32 


2560.5 


54. 


9 


945 


15 


us- 


10- 


087- 


684- 


34 


Sequence 


34, Appl 


33 


2560. 5 


54. 


9 


945 


15 


us- 


10- 


218- 


779- 


34 


Sequence 


34, Appl 


34 


2560.5 


54. 


9 


945 


15 


us- 


10- 


037- 


417- 


121 


Sequence 


121, App 


35 


2560. 5 


54. 


9 


945 


15 


us- 


10- 


096- 


625- 


124 


Sequence 


124, App 


36 


2557.5 


54. 


9 


945 


11 


us- 


09- 


972- 


211- 


123 


Sequence 


123, App 


37 


2557.5 


54. 


9 


945 


15 


us- 


10- 


087- 


684- 


33 


Sequence 


33, Appl 


38 


2557.5 


54. 


9 


945 


15 


us- 


10- 


218- 


779- 


33 


Sequence 


33, Appl 


39 


2557.5 


54. 


9 


945 


. 15 


us- 


10- 


096- 


625- 


123 


Sequence 


123, App 


40 


2546.5 


54. 


6 


933 


15 


us- 


10- 


087- 


684- 


2 


Sequence 


2, Appli 


41 


2546.5 


54. 


6 


933 


15 


us- 


10- 


087- 


684- 


4 


Sequence 


4, Appli 


42 


2546.5 


54. 


6 


933 


15 


us- 


10- 


218- 


779- 


2 


Sequence 


2, Appli 


43 


2546.5 


54. 


6 


933 


15 


us- 


10.-: 


218- 


779- 


4 


Sequence 


4, Appli 


44 


2546.5 


54. 


6 


945 


15 


us- 


10- 


037- 


417- 


38 


Sequence 


38, Appl 


45 


2541.5 


54. 


5 


945 


14 


us- 


10- 


028- 


072- 


146 


Sequence 


146, App 



ALIGNMENTS 



RESULT 1 
US-09-918-779-2 

; Sequence 2, Application US/09918779 
; Publication No. US20030064369A1 



GENERAL INFORMATION: 
APPLICANT: Taupier, Raymond 
APPLICANT: Padigaru, Muralidhara 
APPLICANT: Rastelli, Luca 
APPLICANT: Spaderna, Steven 
APPLICANT: Shimkets, Richard 
APPLICANT: Zerhusen, Bryan 
APPLICANT: Spytek, Kimberly 
APPLICANT: Shenoy, Suresh 
APPLICANT: Li, Li 
APPLICANT: Gusev, Vladimir 
APPLICANT: .Grosse, William 
APPLICANT: Alsobrook, John 
APPLICANT: Lepley, Denise 
APPLICANT: Burgess, Catherine 
APPLICANT: Gerlach, Valerie 
APPLICANT: Ellerman, Karen 
APPLICANT: MacDougall, John 
APPLICANT: Stone, David 
APPLICANT: Smithson, Glennda 

TITLE OF INVENTION: Novel Proteins and Nucleic Acids Encoding Same 
FILE REFERENCE: 21402-074 US 
CURRENT APPLICATION NUMBER: US/09/918,779 
CURRENT FILING DATE: 2001-07-30 
PRIOR APPLICATION NUMBER: 60/221,409 
PRIOR FILING DATE: 2000-07-28 
PRIOR APPLICATION NUMBER: 60/222,840 
PRIOR FILING DATE: 2000-08-04 
PRIOR APPLICATION NUMBER: 60/223,752 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,762 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,770 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,769 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/225,146 
PRIOR FILING DATE: 2000-08-14 
PRIOR APPLICATION NUMBER: 60/225,392 
PRIOR FILING DATE: 2000-08-15 
PRIOR APPLICATION NUMBER: 60/225,470 
PRIOR FILING DATE: 2000-08-15 
PRIOR APPLICATION NUMBER: 60/225,697 
PRIOR FILING DATE: 2000-08-16 
PRIOR APPLICATION NUMBER: 60/263,662 
PRIOR FILING DATE: 2001-02-01 
PRIOR APPLICATION NUMBER: 60/281,645 
PRIOR FILING DATE: 2001-04-05 
NUMBER OF SEQ ID NOS : 61 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 2 
LENGTH: 898 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-918-779-2 



Query Match 



100.0%; Score 4660; DB 10; Length 898; 



Best Local Similarity 100.0%; Pred. No. 0; 

Matches 873; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 QQSAWANPVTGANPDLLPHFLV^PEDVYIVKNKPVLLVCKAVPATQIFFKCNGEWVRQV 60 

I I I II I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 
Db 26 QQSATVANPVPGANPDLLPHFLV^PEDVTIV1<NKPVLLVCKAVPATQIFFKCNGEWVRQV 85 

Qy 61 DHVTERSTDGSSGLPTMEWINVSRQQVTSK^ 120 

I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 86 DHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 145 

Qy 121 RLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 146 RLRKNFEQEPIAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 205 

Qy 181 LWRQARLADTANYTCVAKNIVARRRSASAAVIVYWGGW 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 206 LWRQARI^TANYTCVAKNIVARRRSASAAVIVYWGGWSTWTEWSVCSASCGRGWQKR 265 

Qy 241 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 266 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 325 

Qy 301 APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 326 APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 385 

Qy 361 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 386 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 445 

Qy 421 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 480 

I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 446 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 505 

Qy 481 LGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLS PIVSCGPPG 540 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 506 LGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLS PIVSCGPPG 565 

Qy 541 VLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 566 VLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 625 

Qy 601 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I 
Db 626 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 685 

Qy 661 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 720 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 686 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 745 

Qy 721 CTFTLERVSPSTSD1ACKLWWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVG 780 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 746 CT FTLERVS PSTS DLACKLWVWQVEGDGQS FS INFNZTKDTRFAELLALES EAGVPALVG 805 

Qy 781 PSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 840 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 



Db 806 PSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 865 



Qy 841 RHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 873 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 866 RHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 



RESULT 2 
US-10-624-932-2 

Sequence 2, Application US/10624932 
Publication No. US20040096877A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Taupier, Raymond 
Padigaru, Muralidhara 
Luca 
Steven 
Richard 
Bryan 



Rastelli, 
Spaderna, 
Shimkets, 
Zerhusen, 
Spytek, Kimberly 
Shenoy, Suresh 
Li, Li 

Gusev, Vladimir 
Grosse, William 
Alsobrook, John 
Lepley, Denise 
Burgess , Catherine 
Gerlach, Valerie 
Ellerman, Karen 
MacDougall, John 
Stone, David 
Smithson, Glennda 
TITLE OF INVENTION: Novel Proteins and Nucleic Acids Encoding Same 
FILE REFERENCE: 21402-074 US 
CURRENT APPLICATION NUMBER: US/10/624,932 
CURRENT FILING DATE: 2003-07-21 
PRIOR APPLICATION NUMBER: 09/918,779 
PRIOR FILING DATE: 2001-07-03 
PRIOR APPLICATION NUMBER: 60/221,409 
PRIOR FILING DATE: 2000-07-28 
PRIOR APPLICATION NUMBER: 60/222,840 
PRIOR FILING DATE: 2000-08-04 
PRIOR APPLICATION NUMBER: 60/223,752 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER:- 60/223,762 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,770 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,769 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/225,146 
PRIOR FILING DATE: 2000-08-14 
PRIOR APPLICATION NUMBER: 60/225,392 
PRIOR FILING DATE: 2000-08-15 
PRIOR APPLICATION NUMBER: 60/225,470 
PRIOR FILING DATE: 2000-08-15 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 61 



; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 2 
; LENGTH: 898 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-10-624-932-2 

Query Match 100.0%; Score 4660; DB 15; Length 898; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 873; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 QQ S ATVAN P VP GAN P DL L P H FLVE P EDVYI VKN K PVLLVC KAVP ATQ I F FKCNGEWVRQV 60 

I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I II I I I 
Db 26 QQ S ATVAN P VP GAN P D L L PH FLVE P E DVY I VKN KP VLLVC KAVP ATQ I FFKCNGEWRQV 85 

Qy 61 DHVIERSTDGSSGLPTMEVRINVSRQQV^KVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 86 DHVI ERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWS SSGTTKSQKAYI RIA 145 

Qy 121 RLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I 
Db 146 RLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 205 

Qy 181 LWRQARLADTAN YT CVAKN I VARRRS AS AAVI VYVNGGW S T WTEWS VC S AS CGRGWQ KR 240 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 206 LVVRQARLADTANYTCVAl^IVARRRSASAAVIVYWGGWSTWTEWSVCSASCGRGWQKR 265 

Qy 241 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 300 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I II I I I I I I I I I II I I I I I 

Db 266 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 325 

Qy 301 APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 360 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 326 APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 385 

Qy 361 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 386 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 445 

Qy 421 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 446 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 505 

Qy 481 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 506 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 565- 

Qy 541 VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I II I I I I I 
Db 566 VLLTRPVI LAMDHCGEPS PDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 625 

Qy 601 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 626 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 685 



Qy 



661 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



720 



Db 686 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 745 

Qy 721 CT FT LERVS P ST S DLACKLWVWQVEGDGQS FS IN FN I T KDT RFAELLALE S EAGVPALVG 780 

I I I I I I I I I I I I I I I I I I I M I I II I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 
Db 746 CTFTLERVSPSTSDIACKLWWQVEGDGQSFSINFNITKDTRFAELLALES EAGVPALVG 805 

Qy 781 PSAFKI P FLI RQKI I S SLDPPCRRGADWRTLAQKLHLDSHLS FFASKPS PTAMI LNLWEA 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 806 P SAFKI PFLI RQKI I S SLDP PCRRGADWRTLAQKLHLDSHLS FFAS KP S PTAMI LNLWEA 865 

Qy 841 RH FPNGNLS QLAAAVAGLGQ P DAGL FTVS EAEC 873 

I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I 
Db 866 RHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 



RESULT 3 
US-09-970-944-2 

; Sequence 2, Application US/09970944 
; Publication No. US20030204052A1 
; GENERAL INFORMATION: 

APPLICANT: Herrman, John L 
; APPLICANT: Rastelli, Luca 
; APPLICANT: Shimkets, Richard A 

; TITLE OF INVENTION: No. US20030204052Alel Proteins and Nucleic Acids Encoding 
Same and 

; TITLE OF INVENTION: Antibodies Directed Against these Proteins 
; FILE REFERENCE: 21402-138 

; CURRENT APPLICATION NUMBER: US/09/970,944 
; CURRENT FILING DATE: 2002-05-02 
; PRIOR APPLICATION NUMBER: 60/237,862 
; PRIOR FILING DATE: 2000-10-04 
; NUMBER OF SEQ ID NOS : 62 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 2 

LENGTH: 899 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-970-944-2 

Query Match 98.0%; Score 4567.5; DB 10; Length 899; 

Best Local Similarity 98.6%; Pred. No. 0; 

Matches 863; Conservative 2; Mismatches 7; Indels 3; Gaps 

Qy 1 QQSATVANPVTGANPDLLPHFLVEPEDWIVTCNKPV^ 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 26 QQS AT VAN P VP GAN P D LL PH FLVE P E DVY I VKN K PVLL VCKAVP ATQ I F FKCN GEWVRQV 

Qy 61 DHVI ERSTDGS SGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWS SSGTTKSQKAYI RI A 

' I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 86 DHVI ERSTDGS SGEPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQPCAYIRI A 

Qy 121 RLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 146 RLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVFIWLRNEDLVDPSLDPNVYITREHS 

Qy 181 LVVRQARIADTANYTCVAIOtflVARRRSASAAVT 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



3; 

60 

85 

120 

145 

180 

205 

240 



Db 206 LWRQARLADT AN YT CVAKN I VARRRS AS AAVI VYVNGGW S TWT EWS VC S AS C GRGWQKR 265 

Qy 241 SRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSD 299 

I I I I I i I I I I I I I I I I I I I II : I : I 1111111111111111111111111111 
Db 266 S RSCTNPAPLNGGAFCEGQNVHDRTVS S LLVSVDGSWS PWS KWSACGLDCTHWRS RECSD 325 

Qy 300 PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 359 

I I I I I I I I II I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 326 PAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYC 385 

Qy 360 RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGP 419 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 386 RKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLS-TTTTYQGSLCPRQDGP 444 

Qy 420 S PKFQLTNGHLLS PLGGGRHTLHHS S PTSEAEEFVSRLSTQNYFRSLPRGTSNMT YGTFN 479 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 445 SPKFQLTNGHLLSPLGGGRHTLHHSS PTSEAEEFVSRLSTQNYFRSLPRGTSNMT YGTFN 504 

Qy 480 FLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 539 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 505 FLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPP 564 

Qy 540 GVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWE-DVLHLGEEAPSHLYYCQLEASA 598 

I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I II I I I I I 
Db 565 GVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEQDVLHLGEEAPSHLYYCQLEASA 624 

Qy 599 CWFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 658 

I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 625 CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 684 

Qy 659 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 718 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 685 LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 744 

Qy 719 LHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 778 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 745 LHCT FTLERVS P ST S DLACKLWVWQVEGDGQS FS I N FN I T KDT RFAELLALES EAGVPAL 804 

Qy 779 VGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLW 838 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 805 VG P S AFK I PFLIRQKIISSLDPPC RRGADWRT LAQ KLHLD S HL S F FAS K P S PTAMI LN LW 864 

Qy 839 EARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 873 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 865 EARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 899 



RESULT 4 
US-09-933-261-5 

; Sequence 5, Application US/09933261 
; Publication No. US20030040046A1 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
; Leonardo, E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 



TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

STREET: 268 BUSH STREET, SUITE 3200 

CITY: SAN FRANCISCO 

STATE: CALIFORNIA 
; COUNTRY: USA 

; ZIP: 94104 

; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/09/933,261 

FILING DATE: 20-Aug-2001 
; CLASSIFICATION: <Unknown> 

; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/808,982 
FILING DATE: <Unknown> 
ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 898 amino acids 
; TYPE: amino acid 

; STRANDEDNESS: No. US20030040046A1 Relevant 

TOPOLOGY: No. US20030040046A1 Relevant 
; MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-933-261-5 

Query Match 96.8%; Score 4511; DB 10; Length 898; 

Best Local Similarity 96.0%; Pred. No. 0; 

Matches 838; Conservative 17; Mismatches 18; Indels 0; Gaps 0; 



Qy 1 QQS AT VAN P VP GAN P DLL PH FLVE P EDVY I VKNKP VLLVCKAVP AT Q I FFKCNGEWVRQ V 60 

II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 26 QQSATVANPVPGANPDLLPHFLV^PEDWIVKNKPVLLVCKAVPATQIFFKCNGEWVRQV 85 

Qy 61 DHVI ERSTDGS SGLPTMEVRINVS RQQVEKVFGLEEYWCQCVAWS S S GTTKSQKAYI RI A 120 

I I I I I 1-1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 86 DHVIERSTDSSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 145 

Qy 121 RLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 180 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 146 YLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 205 



Qy 181 LWRQARIJUDTANYTCVAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 206 LVVKQARLADTANYTCVAKNIVARRRSTSAAVIVYVNGGWSTWTEWSVCSASCGRGWQKR 265 



Qy 241 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 266 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSSWSKWSACGLDCTHWRSRECSDP 325 

Qy 301 APRNGGEECQGTDLDTRNCTSDLCVHSA5GPEDVALWGLIAVAVCLVLLLLVLILVYCR 360 

111111111:1 I I I II I I I I I I I : I : I I I I I I I I I : I I : I I I I I I I I I I I hill 
Db 326 APRNGGEECRGADLDTRNCTSDLCLHTASCPEDVALYIGLVAVAVCLFLLLLALGLIYCR 385 

Qy 361 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 420 

I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 386 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCSRQDGPS 445 

Qy 421 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 480 

I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I : II I I I I I I I I I I I I I I I I I I I I II I I I 
Db 446 PKFQLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVSRLSTQNYFRSLPRGTSNMAYGTFNF 505 

Qy 481 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I: I I I I I I I 
Db 506 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPWSCGPPG 565 

Qy 541 VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 600 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I II I I II I I I I I I : I I I I I I II I II III 
Db 566 VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEESPSHLYYCQLEAGACY 625 

Qy 601 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 660 

I I I I I I I I I I I I I I I I I I II I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 626 VFTEQLGRFALVGEALSVAATKRLRLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 685 

Qy 661 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 72 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I 
Db 686 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQQYLH 745 

Qy 721 CTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVG 780 

I I I I I M : : II I I I I II : I II I I I I II I I I : I I II II I I II I I I I I I I I I I lllllll 
Db 746 CTFTLERINASTSDIJVCKVWVWQVEGDGQSFNINFNITKDTRFAELLALESEGGVPALVG 805 

Qy 781 PSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 840 

I I I I I I I I I I I I I I I : I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 806 PSAFKIPFLIRQKIIASLDPPCSRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 865 

Qy 841 RHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 873 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 866 RHFPNGNLGQLAAAVAGLGQPDAGLFTVSEAEC 898 



RESULT 5 

US-09-970-944-13 

; Sequence 13, Application US/09970944 

; Publication No. US20030204052A1 

; GENERAL INFORMATION: 

; APPLICANT: Herrman, John L 

; APPLICANT: Rastelli, Luca 

; APPLICANT: Shimkets, Richard A 

; TITLE OF INVENTION: No. US20030204052Alel Proteins and Nucleic Acids Encoding * 
Same and 

TITLE OF INVENTION: Antibodies Directed Against these Proteins 



; FILE REFERENCE: 21402-138 

; CURRENT APPLICATION NUMBER: US/09/970,944 

; CURRENT FILING DATE: 2002-05-02 

; PRIOR APPLICATION NUMBER: 60/237,862 

; PRIOR FILING DATE: 2000-10-04 

; NUMBER OF SEQ ID NOS : 62 

; SOFTWARE: Patent In Ver. 2.1 

; SEQ ID NO 13 

LENGTH: 898 

TYPE: PRT 
; ORGANISM: Rattus norvegicus 
US-09-970-944-13 

Query Match 96.8%; Score 4511; DB 10; Length 898; 

Best Local Similarity 96.0%; Pred. No. 0; 

Matches 838; Conservative 17; Mismatches 18; Indels 0; Gaps 0; 



Qy 1 QQSATVANPVPGANPDLLPHFLVTIPEDVYIVIWKPVXLVCKAVPATQIFFKCNGEWVRQV 60 

I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 26 QQSATVANPVPGANPDLLPHFLVEPEDWIVTCNKPVLLVCKAVPATQIFFKCNGEWVRQV 85 

Qy 61 DHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 120 

I I I II I I I I I I I I I I I I I I I II I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I II 

Db 86 DHVIERSTDSSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 145 

Qy 121 RLRKNFEQEPLAKEVSLEQGIvXPCRPPEGIPPAEv^WLRNEDLvT)PSLDP^^VYITREHS 180 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 146 YLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAEVTlWLRNEDLvT)PSLDPNVYITREHS 205 

Qy 181 LWRQARLADTAN YT CVAKN I VARRRS AS AAVT VYVN GGW S TWT EW SVC S AS C GRGWQ KR 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I 

Db 206 LWRQARLADTAN YT CVAKN I VARRRS T S AAVI VYVN GGW S TWT EW SVC S AS CG RGWQ KR 265 

Qy 241 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 266 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSSWSKWSACGLDCTHWRSRECSDP 325 

Qy 301 APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 360 

111111111:1 I I I I I I I I I I I I : I : I I I I I I I I I : I I : I I I I I I I I I I I hill 

Db 326 APRNGGEECRGADLDTRNCTSDLCLHTASCPEDVALYIGLVAVAVCLFLLLLALGLIYCR 385 

Qy 361 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 420 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 

Db 386 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCSRQDGPS 445 

Qy 421 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 480 

I II I I : I I II I I I I I I I I I I I I I I I I I I I I : I I I I II I I I I I I I I I I I I I I I I I II I I 

Db 446 PKFQLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVSRLSTQNYFRSLPRGTSNMAYGTFNF 505 

Qy 481 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II : I I I I I I I 

Db 506 LGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLSPWSCGPPG 565 

Qy 541 VLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 600 

I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I : I I I I I I II I I I III 

Db 566 VLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEESPSHLYYCQLEAGACY 625 



601 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRWCLHDTHDALKEVVQLE 660 

I I I I I I I I I I I 1 I II I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I 
626 VFTEQLGRFALVGEALSVAATKRLRLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 685 

661 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 720 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I 
686 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQQYLH 745 

721 CT FT L E RVS P ST S D LACKLWVWQVEGD GQ SFSINFNITKDT RFAE LLALE S EAGVPALVG 780 

I I I I I I I : : I I I I I I I I : I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I II I I I I I 
746 CTFTLERINASTSDLACK\AtfWQVEGDGQSFNINFNITKDTRFAELLALESEGGVPALVG 805 

781 P SAFKI P FLI RQKI I S S LDPPCRRGADWRTLAQKLHLDSHLS FFAS KPS PTAMI LNLWEA 840 

I I I I I I I I I I I I I I I : I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
806 P SAFKI P FLI RQKI IASLDPPCSRGADWRTLAQKLHLDSHLS FFAS KPS PTAMI LNLWE1A 865 

841 RH F PN GN L S Q LAAAVAGL GQ P DAGL FTVS EAEC 873 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
866 RH F PN GN L GQLAAAVAGL GQ P DAGL FTVS EAEC 898 



RESULT 6 
US-10-256-702-5 

; Sequence 5, Application US/10256702 
; Publication No. US20030059859A1 
; GENERAL INFORMATION: 

; APPLICANT: Tessier-Lavigne, Marc 

; Leonardo , E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

TITLE OF INVENTION: Netrin Receptors 
; NUMBER OF SEQUENCES: 8 

; CORRESPONDENCE ADDRESS: 

; ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

STREET: 268 BUSH STREET, SUITE 3200 
; CITY: SAN FRANCISCO 

; STATE: CALIFORNIA 

COUNTRY: USA 

ZIP: 94104 
; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/256,702 

FILING DATE: 27-Sep-2002 

CLASSIFICATION: <Unknown> 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/933 , 261 

FILING DATE: 20-Aug-2001 
; APPLICATION NUMBER: 08/808,982 

FILING DATE: <Unknown> 
; ATTORNEY/AGENT INFORMATION: 

; NAME: OSMAN, RICHARD A 

REGISTRATION NUMBER: 36,627 



Qy 

Db 

Qy 

Db 

Qy 

Db 
Qy 

Db 

Qy 

Db 



REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 898 amino acids 

; TYPE: amino acid 

STRANDEDNESS: No. US20030059859A1 Relevant 
TOPOLOGY: No. US20030059859A1 Relevant 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-10-256-702-5 

Query Match 96.8%; Score 4511; DB 14; Length 898; 

Best Local Similarity 96.0%; Pred. No. 0; 

Matches 838; Conservative 17; Mismatches 18; Indels 0; Gaps 0; 

Qy 1 QQSATVANPVPGANPDLLPHFLVXPEDVYIVKNKPVIiLVCKAVPATQIFFKCNGEWVRQV 60 

I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 26 QQSATVANPVPGANPDLLPHFLWPEDVYIVKNKPVLLVCKAVPATQIFFKCNGEWVRQV 85 

Qy 61 DHVIERSTDGSSGLPTMEVT^INVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 86 DHVIERSTDSSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 145 

Qy 121 RLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 146 YLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 205 

Qy 181 L WRQARLADT AN YT C VAKN I VARRRS AS AAVI VYVN GGW S T WT EW SVC S AS C GRGWQKR 240 

I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 206 LWRQARLADT AN YT C VAKN I VARRRS T S AAVI VYVN GGWS TWT EW SVC S AS C GRGWQKR 265 

Qy 241 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 266 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSSWSKWSACGLDCTHWRSRECSDP 325 

Qy 301 APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 360 

111111111:1 I I I I I I I I I I I I : I : I I I I I I I I I : I I : I I I I I I I I II I I : I I I 
Db 326 APRNGGEECRGADLDTRNCTSDLCLHTASCPEDVALYIGLVAVAVCLFLLLLALGLIYCR 385 

Qy 361 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 420 

I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 386 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCSRQDGPS 445 

Qy 421 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 480 

I I I II : I I I I I I I I I I I I I I I I I I II I I I I : I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 446 PKFQLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVSRLSTQNYFRSLPRGTSNMAYGTFNF 505 

Qy 481 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I 
Db 506 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPWSCGPPG 565 



Qy 

Db 



541 VLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 600 

I I I I I I I I I I I I I I I I I I I I I I I U I I I I I I I I I I I I I I I I I II : I I I I I I I I I I I III 
566 VLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEESPSHLYYCQLEAGACY 625 



Qy 601 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 660 

I I I I I I I I I I I I I II I I I I I I I I : I I II II I I I II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 626 VFTEQLGRFALVGEALSVAATKRLRLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 685 

Qy 661 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I : I I I 
Db 686 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQQYLH 745 

Qy 721 CTFTLERVSPSTSDLACKLWWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVG 780 

I I I I I I I : : I I I I I I I I : I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 746 CTFTLERINASTSDIACKVVA/WQVEG 805 

Qy 781 PSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 840 

I I I I I I I I I I I II I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 806 PSAFKIPFLIRQKIIASLDPPCSRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 865 

Qy 841 RH FPN GN L S Q LAAAVAGL GQ P DAGL FT VS EAEC 873 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 866 RH FPN GNLGQLAAAVAGLGQP DAGL FT VS EAEC 898 



RESULT 7 

US-10-240-154-16 

; Sequence 16, Application US/10240154 

; Publication No. US20030175741A1 

; GENERAL INFORMATION: 

; APPLICANT: Cochran et al . 

; TITLE OF INVENTION: SCHIZOPHRENIA RELATED GENES 

; FILE REFERENCE: CKFW-P01-006 

; CURRENT APPLICATION NUMBER: US/ 10/240, 154 

; CURRENT FILING DATE: 2001-04-02 

; PRIOR APPLICATION NUMBER: PCT/GB01/014 8 6 

; PRIOR FILING DATE: 2001-04-02 

; NUMBER OF SEQ ID NOS : 34 

; SOFTWARE: Patentln version 3.2 

; SEQ ID NO 16 

LENGTH: 898 

TYPE: PRT 

ORGANISM: Rattus sp. 
US-10-240-154-16 

Query Match 96.8%; Score 4511; DB 14; Length 898; 

Best Local Similarity 96.0%; Pred. No. 0; 

Matches 838; Conservative 17; Mismatches . 18; Indels 0; Gaps 0; 

Qy 1 QQSATVANPVPGANPDLLPHFLV^PEDWIVl^KPvTLVCKAVPATQIFFKCNGEWVRQV 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 26 QQSATVANPVPGANPDLLPHFLVT:PEDvTIvT<NKPVTiLVCKAVPATQIFFKCNGEWVRQV 85 

Qy 61 DHVIERSTDGSSGLPTMEWINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 86 DHVI ERSTDSS SGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWS S SGTTKSQKAYI RIA 145 

Qy 121 RLRKNFEQEPIAKEVSLEQGIVTiPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 146 YLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 205 



Qy 181 LWRQARLADTANYTCVAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKR 240 

I I I I I I I I I I I I I I I I I I I I I 1 II I I I I I I I I I II I I I I I I I i I I I I I I I I I I I I I I II 
Db 206 LWRQARLADT AN YT C VAKN I VARRRS T S AAVI VYVN GGW S TWT EW SVC S AS CGRGWQKR 265 

Qy 241 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 266 S RSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWS SWS KWSACGLDCTHWRS RECS DP 325 

Qy 301 APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 360 

111111111:1 I I I I I I I I I I I I : I : I I I I I I I I I : I I : I I I I I I I I I I I hill 
Db 326 APRNGGEECRGADLDTRNCT S DLCLHTASCPEDVAL YI GLVAVAVCLFLLLLALGLI YCR 385 

Qy 361 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 420 

I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 386 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCSRQDGPS 445 

Qy 421 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 480 

I I I I I : I I I I I I I I I I I I I I I I II I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 446 PKFQLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVSRLSTQNYFRSLPRGTSNMAYGTFNF 505 

Qy 481 LGGRLMIPNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 540 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I 
Db 506 LGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLSPWSCGPPG 565 

Qy 541 VLLTRPVT LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLE1ASACY 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I : I I I I I I I I I II I I I 
Db 566 VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEESPSHLYYCQLEAGACY 625 

Qy 601 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 660 

I I I I I I I I II I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 626 VFTEQLGRFALVGEALSVAATKRLRLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 685 

Qy 661 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I 
Db 686 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQQYLH 745 

Qy 721 CTFTLERVSPSTSDLACKLWWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVG 780 

I I I I I I I :: II I I I I I I : I I I I II I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 746 CTFTLERINASTSDLACKVWVWQVEGDGQSFNINFNITKDTRFAELLALESEGGVPALVG 805 

Qy 781 PSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 840 

I I I I I I I I II I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 806 P SAFKI P FLI RQKI I AS LDP PCSRGADWRTLAQKLHLDSHLS FFASKPS PTAMI LNLWEA 865 

Qy 841 RH F PNGN L S Q LAAAVAGLGQ P DAGL FTVS EAEC 873 

II I I I I I I I I I I I I II I I I I II I I I I I I I I I I 

Db 866 RHFPNGNLGQLAAAVAGLGQPDAGLFTVSEAEC 898 



RESULT 8 
US-10-311-623-1 

; Sequence 1, Application US/10311623 
; Publication No. US20040023244A1 
; GENERAL INFORMATION: 

; APPLICANT: I NCYTE GENOMICS, INC.; GRIFFIN, Jennifer A. 
; APPLICANT: KALLICK, Deborah A.; TRIBOULEY, Catherine M. 



APPLICANT: YUE, Henry; NGUYEN, Danniel B. 
APPLICANT: TANG, Y. Tom; LAL, Preeti G. 
APPLICANT: POLICKY, Jennifer L. ; AZIMZAI, Yalda 
APPLICANT: LU, Dyung Aina M. ; GRAUL, Richard C. 
APPLICANT: YAO, Monique G. ; BURFORD, Neil 
APPLICANT: HAFALIA, April J. A.; BAUGHN, Mariah R. 
APPLICANT: BANDMAN, Olga; ARVIZU, Chandra S. 
APPLICANT: YANG, Junming; XU, Yuming 
APPLICANT: GANDHI, Ameena R. ; WARREN, Bridget A. 
APPLICANT: DING, Li; SANJANWALA, Madhusudan M. 
APPLICANT: DUGGAN, Brendan M. ; LU, Yan 
TITLE OF INVENTION: RECEPTORS 
FILE REFERENCE: PF-0793 USN 
CURRENT APPLICATION NUMBER: US/10/311,623 
CURRENT FILING DATE: 2002-12-17 
PRIOR APPLICATION NUMBER: US 01/19942 
PRIOR FILING DATE: 2001-06-21 
PRIOR APPLICATION NUMBER: US 60/214,027 
PRIOR FILING DATE: 2000-06-21 
PRIOR APPLICATION NUMBER: US 60/228,045 
PRIOR FILING DATE: 2000-08-25 
PRIOR APPLICATION NUMBER: US 60/255,104 
PRIOR FILING DATE: 2000-12-12 
NUMBER OF SEQ ID NOS : 24 
SOFTWARE: PERL Program 
SEQ ID NO 1 
LENGTH: 842 
TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE: 

NAME/KEY: misc_f eature 

OTHER INFORMATION: Incyte ID No. US20040023244A1 6052371CD1 
US-10-311-623-1 

Query Match 91.9%; Score 4282; DB 15; Length 842; 

Best Local Similarity 93.4%; Pred. No. 0; 

Matches 815; Conservative 1; Mismatches 1; Indels 56; Gaps 1; 



Qy 


1 


QQSATVANPVPGANPDLLPHFLV^PEDVTIVICNKPVLLVCKAVPATQIFFKCNGEWVRQV 


60 




i in 1 1 ii iii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i i ii 1 1 1 ii ii i ii 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


26 


QQSATVANPVPGANPDLLPHFLV^PEDWIVl^NKPVXLVCKAVPATQIFFKCNGEWVRQV 


85 


Qy 


61 


DHVI ERSTDGS SGLPTMEVl*INVSRQQvT5KVFGLEEYWCQCVAWS S SGTTKSQKAYI RIA 


120 






1 I I I I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


86 


DHVI ERSTDGS S GLPTMEVRINVS RQQVEKVFGLEEYWCQCVAWS S SGTTKSQKAYI RIA 


145 


Qy 


121 


RLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAEV^WLRNEDLVT)PSLDPNVYITREHS 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


146 


YLRKNFEQEPIJVKEVSLEQGIVliPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHS 


205 


Qy 


181 


LWRQARLADTANYTCVAKNIVARRRSASAAVIVTWGGWSTWTEWSVCSASCGRGWQKR 


240 






1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


206 


L WRQARLADTAN YT CVAKN I VARRRS AS AAVI VY 


240 


Qy 


241 


SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 


300 




1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


241 


VDGSWSPWSKWSACGLDCTHWRSRECSDP 


269 



Qy 301 APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 360 

I I I I i I I I I I I I I I I I I I I I I I I I II : I I I I I I I I I I I I I 1 I I I I II I I I I I I I I I I I I I 
Db 270 APRNGGEECQGTDLDTRNCTSDLCVHTASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 329 

Qy 361 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 330 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 389 

Qy 421 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEEVSRLSTQNYFRSLPRGTSNMTYGTFNF 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 390 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 449 

Qy 481 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 450 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 509 

Qy 541 VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 510 VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 569 

Qy 601 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 570 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 629 

Qy 661 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 630 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 689 

Qy 721 CTFTLERVSPSTSDLACKLWWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVG 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 690 CT FTLERVS PST S DLACKLWVWQVEGDGQS FS IN FNI TKDTRFAELLALES EAGVPALVG 749 

Qy 781 PSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 750 PSAFKI PFLI RQKI I S SLDPPCRRGADWRTLAQKLHLDSHLS FFASKPS PTAMI LNLWEA 809 

Qy 841 RHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 873 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 810 RHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 842 



RESULT 9 

US-09-970-944-14 

; Sequence 14, Application US/09970944 

; Publication No. US20030204052A1 

; GENERAL INFORMATION : 

; APPLICANT: Herrman, John L 

; APPLICANT: Rastelli, Luca 

; APPLICANT: Shimkets, Richard A 

; TITLE OF INVENTION: No. US20030204052Alel Proteins and Nucleic Acids Encoding 
Same and 

; TITLE OF INVENTION: Antibodies Directed Against these Proteins 
; FILE REFERENCE: 21402-138 

; CURRENT APPLICATION NUMBER: US/09/970,944 

; CURRENT FILING DATE: 2002-05-02 

; PRIOR APPLICATION NUMBER: 60/237,862 



; PRIOR FILING DATE: 2000-10-04 
; NUMBER OF .SEQ ID NOS : 62 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 14 

LENGTH: 544 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-09-970-944-14 

Query Match 61.1%; Score 2845; DB 10; Length 544; 

Best Local Similarity 100.0%; Pred. No. 2.7e-227; 

Matches 541; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 333 DVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNP 392 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 DVAL YVGL I AVAVCLVLL LLVL I LVYCRKKEGLD S DVAD S S I LT S GFQ PVS I K P S KADN P 63 

Qy 393 HLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEE 452 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 64 HLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEE 123 

Qy 453 FVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLT 512 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 124 FVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLT 183 

Qy 513 LHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCE 572 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 184 LHKPEDVl^LPLAGCQTLLSPIVSCGPPGVLLTRPVT LAMDHCGEPSPDSWSLRLKKQSCE 243 

Qy 573 GSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPV 632 

I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 244 GSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPV 303 

Qy 633 ACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPS 692 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 304 ACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPS 363 

Qy 693 SLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFS 752 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 364 SLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVS PSTSDLACKLWVWQVEGDGQSFS 423 

Qy 753 I N FN I T KDT RFAELLALE S EAGVP ALVG P S AFK I P FL I RQ KI I S S LD P PC RRGADWRT LA 812 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 424 INFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRG7U)WRTLA 483 

Qy 813 QKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAE 872 . 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I 
Db 484 QKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAE 543 

Qy 873 C 873 

I 

Db 544 C 544 



RESULT 10 
US-09-933-261-6 

; Sequence 6, Application US/09933261 



; Publication No. US20030040046A1 

GENERAL INFORMATION: 
; APPLICANT: Tessier-Lavigne, Marc 

Leonardo, E. David 
; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
; STATE: CALIFORNIA 

COUNTRY: USA 
; ZIP: 94104 

; COMPUTER READABLE FORM: ■ 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/09/933,261 

FILING DATE: 20-Aug-2001 
; CLASSIFICATION: <Unknown> 

; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/808,982 
; FILING DATE: <Unknown> 

; ATTORNEY/AGENT INFORMATION: 

NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (415) 343-4341 

; TELEFAX: (415) 343-4342 

; INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 557 amino acids 
; TYPE: amino acid 

STRANDEDNESS: No. US20030040046A1 Relevant 
TOPOLOGY: No. US20030040046A1 Relevant 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
US-09-933-261-6 



Query Match 60.4%; Score 2815.5; DB 10; Length 557; 

Best Local Similarity 96.8%; Pred. No. 7.9e-225; 

Matches 539; Conservative 2; Mismatches 15; Indels 1; Gaps 1; 

Qy 318 NCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTS 377 

I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I 
Db 1 NCTSDLXVHTASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTS 60 



Qy 378 GFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGG 437 

I I I I i I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I 
Db 61 GFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGG 120 



Qy 

Db 



438 
121 



RHTLHHSSPTSEAEEEVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIP 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! 11 I I I I I I I I I I I I I 
RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIP 



497 
180 



Qy 498 PDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEP 557 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 PDAIPRGKIYEIYLTLHKPEDVRLPIAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEP 240 

Qy 558 SPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALS 617 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 SPDSWSLALKKQSCEGSWEDVXHLGEEAPSHLYYCQLEASACWFTEQLGRFALVGEALS 300 

Qy 618 VAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFK 677 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 VAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHLX 360 

Qy 678 DSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLAC 737 

I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 DSYHNLXLSXHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLAC 420 

Qy 738 KLWWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISS 797 

I II I I I I I I I I I I I I II I I I I I I II I II I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 421 KLWVWQVEGDGQS FS INFNITKDTRFAELLALESEAGVPALVGPSAFKI PFLI RQKI I SS 480 

Qy 798 LDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAG 857 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I, 
Db 481 LDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMI LNLWEARHFPNGNLSQLAAAVAG 540 

Qy 858 LGQPDAGLFT-VSEAEC 873 

I : I I I I I 
Db 541 TXPAGRWLLSQCSEAEC 557 



RESULT 11 
US-10-256-702-6 

; Sequence 6, Application US/10256702 
; Publication No. US20030059859A1 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
; Leonardo , E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
; COUNTRY: USA 

; ZIP: 94104 

; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0 , Version #1.30 



CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/256,702 
FILING DATE: 27-Sep-2002 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: US/09/933,261 

; FILING DATE: 20-Aug-2001 

APPLICATION NUMBER: 08/808,982 
FILING DATE: <Unknown> 
ATTORNEY/ AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 557 amino acids 

; TYPE: amino acid 

STRANDEDNESS: No. US20030059859A1 Relevant 
TOPOLOGY: No. US20030059859A1 Relevant 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
US-10-256-702-6 

Query Match 60.4%; Score 2815.5; DB 14; Length 557; 

Best Local Similarity 96.8%; Pred. No. 7.9e-225; 

Matches 539; Conservative 2; Mismatches 15; Indels 1; Gaps 1; 

Qy 318 NCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTS 377 

I I I I II I I : I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 NCTSDLXVHTASGPEDVTUjYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTS 60 

Qy 378 GFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGG 437 

I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 GFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGG 120 

Qy 438 RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGISLLIP 497 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 121 RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGISLLIP 180 

Qy 498 PDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEP 557 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I 
Db 181 PDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEP 240 

Qy 558 SPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALS 617 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 241 SPDSWSLALKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALS 300 

Qy 618 VAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFK 677 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 301 VAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHLX 360 



Qy 

Db 



678 
361 



737 
420 



Qy 738 KLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISS 797 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 421 KLWVWQVEGDGQS FS INFNITKDTRFAELLALES EAGVPALVGPSAFKI P FLI RQKI I S S 480 

Qy 798 LDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAG 857 

I I I I I I I I I I I i I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 LDPPCRRGADWRTIAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQIA^VAG 540 

Qy 858 LGQPDAGLFT-VSEAEC 873 

I : II I I I 
Db 541 TXPAGRWLLSQCSEAEC 557 



RESULT 12 
US-09-970-944-15 

; Sequence 15, Application US/09970944 

; Publication No. US20030204052A1 

; GENERAL INFORMATION: 

; APPLICANT: Herrman, John L 

; APPLICANT: Rastelli, Luca 

; APPLICANT: Shimkets, Richard A 

; TITLE OF INVENTION: No. US20030204052Alel Proteins and Nucleic Acids Encoding 
Same and 

; TITLE OF INVENTION: Antibodies Directed Against these Proteins 
; FILE REFERENCE: 21402-138 

; CURRENT APPLICATION NUMBER: US/09/970,944 

; CURRENT FILING DATE: 2002-05-02 

; PRIOR APPLICATION NUMBER: 60/237,862 

; PRIOR FILING DATE: 2000-10-04 

; NUMBER OF SEQ ID NOS : 62 

; SOFTWARE: Patent In Ver. 2.1 

; SEQ ID NO 15 

LENGTH: 931 

TYPE: PRT 

; ORGANISM: Caenorhabditis elegans 
US-09-970-944-15 

Query Match 59.6%; Score 2778.5; DB 10; Length 931; 

Best Local Similarity 58.2%; Pred. No. 2e-221; 
Matches 51: 

Qy 11 

Db 54 

Qy 71 

Db 114 

Qy 131 

Db 174 

Qy 191 

Db 234 



; Conservative 150; Mismatches 195; Indels 23; Gaps 7; 

PGANPDLLPHFLV^PEDVTIVTWKPVTiLVCKAW 70 

I I : I I I I I : I I I : I I II I I I I I I II I I I I I : I I I I III I III:: I 
PSDPPEPLPHFLIEPEEAYIVTOKPWLYCKASPATQIYFKCNSEWV^QKDHWDERVT)^ 113 

SSGLPTMEWINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEP 130 

: I I I II I : I I I I I I : : I I I : I I I I I I I I I I : I I I I I : I I I : I I I III I I I I I 
TSGLIVREVSIEISRQQVEELFGPEDYWCQCVAWSSAGTTKSRKAYVRIAYLRKTFEQEP 173 

LAKEVSLEQGIVXPCRPPEGIPPAEVFIWLRNEDLVTJPSLDPNW 190 

I I I I I I I I : : I I I I I I II I I I I I I I : I I I : : I I : I I III : I : I : : : I I I I : I 
LGKEVSLEQEVLLQCRPPEGI PVAEVEWLKNEDI I DPAEDRNFYITI DHNLI I KQARLSD 233 

TANYTCVAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPL 250 

II I I I I I I I I I I I : I : I : I I I I I I I I I I I I I I I I I I I : : I I I I : I I I : I : I I I I I I I 
TANYTCVAKNIVAKRKSTTATVIVYVNGGWSTWTEWSVCNSRCGRGYQKRTRTCTNPAPL 293 



Qy 251 NGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQ 310 

111111111:111 II I I I I I I I I : I I I I I II : I I I I I III: I I I : I I I : : I 
Db 294 NGGAFCEGQSVQKIACTTLCPVDGRWTSWSKWSTCGTECTHWRRRECTAPAPKNGGKDCD 353 

Qy 311 GTDLDTRNCTSDLCVHSASGPEDVALYVGL-IAVAVCLVLLLLVLILVYCRKKEGLDSDV 369 

I I : : I I I I I : : I : I I I I I I I : I I I I I I : : : I : I I : : I I : 

Db 354 GLVLQSKNCTDGLCMQAAPDSDDVALWGIVIAVTVCLAITVVVALEVYRKNHRDFESDI 413 

Qy 370 ADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGH 429 

I II I I It I I : I I : : I I I : I I I :: I : I : I I I : I I 

Db 414 IDS SALNGGFQPVNI KAARQD LLAVPPDLTSAAAMYRGPVYALHD-VSDKI PMTNSP 469 

Qy 430 LLSPLGGGRHTLHHSS PTSEAEEFVSRLS TQNYF RSLPRGT — S 471 

: I I I : : : : I I I : I I I : I I I I : : I I I I I 

Db 470 ILDPLPNLKIKVYNSSGAWPQDDLAEFSSKLSPQMTQSLLENEALNLKNQSLARQTDPS 529 

Qy 472 NMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLS 531 

: I I I I III I : I I I : I : I I I I I I I I : I : : I I : I : I : I : I :: I I : MM: 
Db 530 CTAFGTFNSLGGHLI I PNSGVSLLI PAGAI PQGRWEMYVTVHRKENMRPPMEDSQTLLT 589 

Qy 532 PIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYY 591 

I : I I I I I I I I I I I I I I I : I I : I I : I :: I I I :: I I I I I :: I I I : I 

Db 590 PWSCGPPGALLTRPVILTLHHCADPSTEDWKIQLKNQAVQGQWEDWWGEENFTTPCY 649 

Qy 592 CQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHD 651 

11:1 Ih: II I :||||:: : MINIM :| |: I : I I 11 : I II I I i II I 
Db 650 IQLDAEACHILTENLSTYALVGQSTTKAAAKRLKLAI FGPLCCSSLEYSIRVYCLDDTQD 709 

Qy 652 ALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHI 711 

II M I : II I : I : II I I : : II : I I I I I I II I II I II : I I I I I II I II I II II II 

Db 710 ALKEVLQLERQMGGQLLEEPKALHFKGSIHNLRLSIHDIAHSLWKSKLLAKYQEIPFYHI 769 

Qy 712 WNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALES 771 

I : I : I I I I I I II I I I : I M : I M I I M I I : I I I : I : : : : : I I : 
Db 770 WSGSQRNLHCTFTLERLSLNTVELVCKLCVRQVEGEGQIFQLNCTVSEEPTGIDLPLLDP 829 

Qy 772 EAGVP ALVG P S AFKI PFLIRQKIISSLDPPC RRGADWRT LAQKLHLD S HL S FFAS K P S PT 831 

: : : II I I I I I II I I : I II I I I I I M I I I I : I I : I : : I I : I II I 
Db 830 ASTITTVTGPSAFSIPLPIRQKLCSSLDAPQTRGHDWRMLAHKLNLDRYLNYFATKSSPT 889 

Qy 832 AMI LNLWEARHFPNGNLS QLAAAVAGLGQ PDAGL FTVS EAE 872 

:||:IMI::||:MM Ml : :|: : : :| : 
Db 890 GVILDLWEAQNFPDGNLSMLAAVLEEMGRHETWSLAAEGQ 930 



RESULT 13 
US-09-972-211-121 

Sequence 121, Application US/09972211 
Publication No. US20040048245A1 
GENERAL INFORMATION: 
APPLICANT: Shimkets, Richard A 
APPLICANT: Taupier Jr, Raymond J 
APPLICANT: Burgess, Catherine E 
APPLICANT: Zerhusen, Bryan D 
APPLICANT: Mezes, Peter S 
APPLICANT: Rastelli, Luca 



A DDT Tr'TVMT 1 • 

ArrIj.LL.ANl : 


Malyankar, Uriel M 


7\ r\ r\ T T f 71 XTfTl - 

APPLICANT : 


Grosse, 


William M 


TV nDT TP7\ XirTi • 

ArrJjluAJM 1 : 


Alsobroo 


K 11, Jonn r 


APPLICANT : 


Lepley, 


Denise M 


APPLICANT : 


Spytek, 


Kimberly Ann 


TV nnT TPTl "NTT* . 

ArrLICANl : 


Li, Li 




APPLICANT: 


Edinger, 


Shlomit 


tv nnT t /*• 7i xTrn . 

APPLICANT : 


Gerlach, 


Valerie 


APPLICANT : 


Ellerman 


, Karen 


APPLICANT: 


MacDougall, John R 


APPLICANT : 


Gunther, 


Erik 


APPLICANT: 


Millet, 


Isabelle 


APPLICANT: 


Stone, David J 


APPLICANT: 


Smithson 


, Glennda 


APPLICANT: 


Szekeres 


Jr, Edward S 



; TITLE OF INVENTION: No. US20040048245Alel Human Proteins, Polynucleotides 
Encoding Them And 

; TITLE OF INVENTION: Methods Of Using The Same 

; FILE REFERENCE: 21402-141 

; CURRENT APPLICATION NUMBER: US/09/972,211 

; CURRENT FILING DATE: 2001-10-05 

; PRIOR APPLICATION NUMBER: 60/238,325 

; PRIOR FILING DATE: 2000-10-05 

; PRIOR APPLICATION NUMBER: 60/238,323 

; PRIOR FILING DATE: 2000-10-05 

; PRIOR APPLICATION NUMBER: 60/238,400 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/238,397 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/238,401 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/238,379 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/238,402 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 30/238,384 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/238,373 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/238,372 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/238,383 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/238,382 

; PRIOR FILING DATE: 2000-10-06 

; PRIOR APPLICATION NUMBER: 60/275,892 

; PRIOR FILING DATE: 2001-03-14 

; PRIOR APPLICATION NUMBER: 60/296,860 

; PRIOR FILING DATE: 2001-06-08 

; NUMBER OF SEQ ID NOS : 198 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 121 

LENGTH: 931 

TYPE : PRT 

; ORGANISM: Mus musculus 
US-09-972-211-121 



Query Match 59.6%; Score 2778.5; DB 11; Length 931; 

Best Local Similarity 58.2%; Pred. No. 2e-221; 

Matches 513; Conservative 150; Mismatches 195; Indels 23; Gaps 7; 

Qy 11 PGANPDLLPHFLVEPEDVTIV7CNKP VXLVCKAWATQI FFKCNGEWVRQVDHVT ERSTDG 70 

I I : I I I I I : I I I : I I I I I I I I I I I I I I I I I : I I I I I I I I I I I : : I 
Db 54 PSDPPEPLPHFLIEPEEAYIVKNKPWLYCKASPATQIYFKCNSEWVlIQKDHvVDERVDE 113 

Qy 71 SSGLPTMEVT^I^SRQQV^KVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEP 130 

: I I I II I : I I I I I I : : I I I : I I I I I I I I I I : I I I I I : I I I : I I I III I I I I I 
Db 114 TSGLIVREVSIEISRQQVEELFGPEDYWCQCVAWSSAGTTKSRKAYVRIAYLRKTFEQEP 173 

Qy 131 LAKEVSLEQGIvXPCRPPEGIPPAEVTCWLRNEDLW 190 

I I I I I I I I : : I I I I I I I I I I I I I I I : I I I : : I I : I I III : I : I : : : I I I I : I 
Db 174 LGKEVSLEQEVLLQCRPPEGIPVAEVEWLKNEDIIDPAEDRNFYITIDHNLIIKQARLSD 233 

Qy 191 TAN YTCVAKN I VARRRS AS AAVI VYVNGGW S TWT EWS VCS AS CGRGWQKRS RS CTNPAP L 250 

I I I I I I I I I I I I I : I : I : I I I I I I I I I I I I I I I I I I I : : I I I I : I I I : I : I I I I I I I 
Db 234 TANYTCVAKNIVAKRKSTTATVIVYVNGGWSTWTEWSVCNSRCGRGYQKRTRTCTNPAPL 293 

Qy 251 NGGAFCEGQNVQKTACATLCPVT)GSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQ 310 

111111111:111 II I I I I I I I I : Mill II : I I I I I III: I I I : I I I : : I 
Db 294 NGGAFCEGQSVQKIACTTLCPVDGRWTSWSKWSTCGTECTHWRRRECTAPAPKNGGKDCD 353 

Qy 311 GTDLDTRNCTSDLCVHSASGPEDVALYVGL-IAVAVCLVLLLLVLILVYCRKKEGLDSDV 369 

I I :: I I I II: : I : I I I I I I I : I I I I I I : : : I : I I : : I I : 

Db 354 GLVTiQSKNCTDGLCMQAAPDSDDVALWGIVIAWVCLM 413 

Qy 370 ADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGH 429 

I I I I I I I I I : I I : : I I I ■ : I I I : : I : I : I II : I I 
Db 414 I DS SALNGGFQPVNI KAARQD LLAVPPDLTSAAAMYRGPVYALHD-VSDKIPMTNSP 469 

Qy 430 LLS PLGGGRHTLHHS S PTSEAEEFVSRLS TQNYF RSLPRGT — S 471 

: I II : : : : I I I : I I I : I I I I : : I I I I I 

Db 470 ILDPLPNLKIKVTNSSGAVTPQDDLAEFSSKLSPQMTQSLLENEALNLKNQSLARQTDPS 529 

Qy 472 NMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLS 531 

: I I I I III I : II I : I : I I I I I I I I : I : : I I : I : I : I : I : : I I : INI: 
Db 530 CTAFGTFNSLGGHLIIPNSGVSLLIPAGAIPQGRVYEMYVTVHRKENMRPPMEDSQTLLT 589 

Qy 532 PIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYY 591 

I : I I I I II I II I I I I II : I I : I I : I :: I I I :: I I I I I :: I II : I 
Db 590 P WS CGP PGALLT RPVI LTLHHCAD P S TEDWKI QLKNQAVQGQWEDWWGEEN FTT PC Y 649 

Qy 592 CQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHD 651 

I I : I II:: II I : I I I I : : : I I II I I I I : I I : I : I I I I : I I I I I I II I 
Db 650 IQLDAEACHILTENLSTYALVGQSTTKAAAKRLKLAI FGPLCCSSLEYSIRVYCLDDTQD 709 

Qy 652 ALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHI 711 

I I I I I : I I I : I : I I I I : : I I : I I I I I MINIMI: I I I 1 I I 1 I I I I I II II I 
Db 710 ALKEVLQLERQMGGQLLEEPKALHFKGSIHNLRLSIHDIAHSLWKSKLLAKYQEIPFYHI 769 

Qy 712 WNGTQRYLHCT FTLERVS PSTSDLACKLWVWQVEGDGQS FS IN FNITKDTRFAELLALES 771 

I : I : I I II I M II II : I : I : I III I ill!: II | : I :::: : I I :. 
Db 770 WSGSQRNLHCTFTLERLSLNTVELVCKLCVRQVEGEGQIFQLNCTVSEEPTGIDLPLLDP 829 



Qy 



772 EAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTIAQKLHLDSHLSFFASKPSPT 831 



Db 



: : : I I I I I I I I I I I : I I I I I I I I I I 1 I I I : II : I : : I I : I I I I 
830 ASTITTVTGPSAFSIPLPIRQKLCSSLDAPQTRGHDWRMLAHKLNLDRYLNYFATKSSPT 889 



Qy 

Db 



832 AMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAE 872 

:||:||||::||:IMI III : :|: : : :| : 
890 GVTLDLWEAQNFPDGNLSMLAAVLEEMGRHETVVSLAAEGQ 930 



RESULT 14 
US-10-087-684-35 

Sequence 35, Application US/10087684 
Publication No. US20040029116A1 
GENERAL INFORMATION: 
APPLICANT: Edinger, Shlomit R. 
APPLICANT: MacDougall, John R. 
APPLICANT: Millet, Isabelle 
APPLICANT: Ellerman, Karen 
APPLICANT: Stone, David J. 
APPLICANT: Grosse, William M. 
APPLICANT: Lepley, Denise M. 
APPLICANT: Rieger, Daniel K. 
APPLICANT: Burgess, Cathereine E. 
APPLICANT: Casman, Stacie, J. 
APPLICANT: Spytek, Kimberly A. 
APPLICANT: Boldog, Ferenc L. 
APPLICANT: Li, Li 

APPLICANT: Padigaru, Muralidhara 
APPLICANT: Mishra, Vishnu 
APPLICANT: Shenoy, Suresh G. 
APPLICANT: Rastelli, Luca 
APPLICANT: Tchernev, Velizar T. 
APPLICANT: Vernet, Corine A.M. 
APPLICANT: Zerhusen, Bryan D. 
APPLICANT: Malyankar, Uriel M. 
APPLICANT: Guo, Xiaojia 
APPLICANT: Miller, Charles E. 
APPLICANT: Gangolli, Esha A. 

TITLE OF INVENTION: PROTEINS AND NUCLEIC ACIDS ENCODING SAME 
FILE REFERENCE: 21402-214 CIP 
CURRENT APPLICATION NUMBER: US/10/087,684 
CURRENT FILING DATE: 2003-03-10 
PRIOR APPLICATION NUMBER: 60/253,834 
PRIOR FILING DATE: 2000-11-29 
PRIOR APPLICATION NUMBER: 60/250,926 
PRIOR FILING DATE: 2000-11-30 
. PRIOR APPLICATION NUMBER: 60/264,180 
PRIOR FILING DATE: 2001-01-25 
PRIOR APPLICATION NUMBER: 60/274,194 
PRIOR FILING DATE: 2001-03-08 
PRIOR APPLICATION NUMBER: 60/313,656 
PRIOR FILING DATE: 2001-08-20 
PRIOR APPLICATION NUMBER: 60/327,456 
PRIOR FILING DATE: 2001-10-05 
NUMBER OF SEQ ID NOS : 220 
SOFTWARE: CuraSeqList version 0.1 
SEQ ID NO 35 
LENGTH: 931 



TYPE: PRT 
; ORGANISM: Mus mus cuius 
US-10-087-684-35 



Query Match 59.6%; Score 2778.5; DB 15; Length 931; 

Best Local Similarity 58.2%; Pred. No. 2e-221; 

Matches 513; Conservative 150; Mismatches 195; Indels 23; Gaps 7; 

Qy 11 PGANPDLLPHFLVEPEDWIVTCNKPVXLVCKAVPATQI^ 70 

I I : I I I I I : I I I : I I I I I I I I I III I I II I : I I I I III I III:: I 
Db 54 PSDPPEPLPHFLIEPEEAYIVT^NKPWLYCKASPATQIYFKCNSEWVliQKDHWDERvTDE 113 

Qy 71 SSGLPTMEv^I^^^SRQQv^KVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEP 130 

: I I I II I : I I I I I I : : II I : I I I I I I I I I I : I I I I I : I I I : I I I III I I I I I 
Db. 114 TSGLIVREVS IEI SRQQVEELFGPEDYWCQCVAWS SAGTTKSRKAYVRIAYLRKTFEQEP 173 

Qy 131 LAKEVSLEQGIVXPCRPPEGIPPAEVEWLRNEDLVDPSLDPNWITREHSLVVRQARI^ 190 

I I I I I I I I : : I I I I I I I I I I I I I I I : I I I : : I I : I I III : I : I : : : I I I I : I 
Db 174 LGKEVSLEQEVLLQCRPPEGI PVAEVEWLKNEDI I DPAEDRNFYITI DHNLI I KQARLSD 233 

Qy 191 TANYTCVAKNIVARRRSASAAVIWWGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPL 250 

I I I I I I I I I I I I I : I : I : I II I I I I I I I I I I I I I I II : : I I I I : I I I : I : I I I I I I I 
Db 234 TANYTCVAKNIVAKRKSTTATVI VTWGGWSTWTEWSVCNSRCGRGYQKRTRTCTNPAPL 293 

Qy 251 NGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQ 310 

I I I I I I I I I : I I I II I I I II I I I : I I I I I II : I I I II III: I I I : I I I :: I 
Db 294 NGGAFCEGQSVQKIACTTLCPVT)GRWTSWSKWSTCGTECTHWRRRECTAPAPKNGGKDCD 353 

Qy 311 GTDLDTRNCTSDLCVHSASGPEDVALYVGL-IAVAVCLVLLLLVLILVYCRKKEGLDSDV 369 

I I : : I I I I I : : I : I I I I I I I : I I I I I I : : : I : I I : : I I : 

Db 354 GLVTiQSKNCTDGLCMQAAPDSDDVALWGIVIAWVCLAITVWALFVYRKNHRDFESDI 413 

Qy 370 ADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGH 429 

I I I I I I I I I : I I : : I I I : I I I : : I : I : I II. : I I 
Db 414 IDS S ALNGG FQ P VN I KAARQ D LLAVPPDLTSAAAMYRGPVYALHD-VSDKIPMTNSP 469 

Qy 430 LLSPLGGGRHTLHHSS PTSEAEEFVSRLS TQNYF RSLPRGT — S 471 

: I I I : : : : I I I : I I I : I I I I : : I I I I I 

Db 470 ILDPLPNLKIKWNSSGAVTPQDDLAEFSSKLSPQMTQSLLENEALNLKNQSLARQTDPS 529 

Qy 472 NMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLS 531 

: I I I I III I : I I I : I : I I I I I I I I : I : : I I : I : I : I : I : : I I : MM: 
Db 530 CTAFGTFNSLGGHLI I PNSGVSLLI PAGAI PQGRWEMYVTVHRKENMRPPMEDSQTLLT 589 

Qy 532 PIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYY 591 

I : I I I I I I I. I I II I I I I : I I : I I : I : : I I I : : I I I I I : : I I I : I 
Db 590 PWSCGPPGALLTRPVILTLHHCADPSTEDWKIQLKNQAVQGQWEDWWGEENFTTPCY 649 

Qy 592 CQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHD 651 

I I : I II:: II I : I I I I : : : I I I I I I I I : I I : I : I I I I : I I I I I I II I 
Db 650 IQLDAEACHILTENLSTYALVGQSTTKAAAKRLKLAI FGPLCCSSLEYSIRVYCLDDTQD 709 

Qy 652 ALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHI 711 

I I I I I : I I I : I : I I I I : : II : II II I MINIMI: I I I I I II I I! Mill! I 
Db 710 ALKEVLQLERQMGGQLLEEPKALHFKGSIHNLRLSIHDIAHSLWKSKLLAKYQEIPFYHI 769 

Qy 712 WNGTQRYLHCTFTLERVSPSTSDLACKLWWQVEGDGQSFSINFNITKDTRFAELLALES 771 



I : I : I I I I I I I I I I I : I : I : I I I I I I I I I : I I I : I : : : : : I I : 
.Db 770 WSGSQRNLHCTFTLERLSLNTVELVCKLCVRQVEGEGQIFQLNCTVSEEPTGIDLPLLDP 829 

Qy 772 EAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPT 831 

: : : Mill II MM: I I I I I I I M I I I M : I I : I : : I I : I I M 
Db 830 ASTITTVTGPSAFSIPLPIRQKLCSSLDAPQTRGHDWRMLAHKLNLDRYLNYFATKSSPT 889 

Qy 832 AMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAE 872 

: II : II I I: : II : II II I II : : I : : : : I : 
Db 890 GVILDLWEAQNFPDGNLSMLAAVLEEMGRHETWSLAAEGQ 930 



RESULT 15 
US-10-037-417-117 

Sequence 117, Application US/10037417 
Publication No. US20040052806A1 
GENERAL INFORMATION: 
APPLICANT: Kekuda, Ramesh 
APPLICANT: Alsobrook II, John P 
APPLICANT: Tchernev, Velizar T 
APPLICANT: Liu f Xiaohong 
APPLICANT: Spytek, Kimberly A 
APPLICANT: Patturajan, Meera 
APPLICANT: Grosse, William M 
APPLICANT: Lepley, Denise M 
APPLICANT: Burgess, Catherine E 
APPLICANT: Vernet, Corine A.M. 
APPLICANT: Li, Li 
APPLICANT: Gorman, Linda 
APPLICANT: Edinger, Shlomit R 
APPLICANT: Sciore, Paul 
APPLICANT: Ellerman, Karen 
APPLICANT: Malyankar, Uriel M 
APPLICANT: Rothenberg, Mark 
APPLICANT: Stone, David J 
APPLICANT: Boldog, Ferenc L 
APPLICANT: Guo, Xiaojia 
APPLICANT: Shenoy, Suresh G 
APPLICANT: Anderson, David W 
APPLICANT: Padigaru, Muralidhara 
APPLICANT: Taupier Jr, Raymond J 
APPLICANT: Miller, Charles E 
APPLICANT: Eisen, Andrew J 

TITLE OF INVENTION: Proteins and Nucleic Acids Encoding Same 
FILE REFERENCE: 21402-235 

CURRENT APPLICATION NUMBER: US/10/037,417 
CURRENT FILING DATE: 2002-09-20 
PRIOR APPLICATION NUMBER: 60/260,018 
PRIOR FILING DATE: 2001-01-05 
PRIOR APPLICATION NUMBER: 60/260,360 
PRIOR FILING DATE: 2001-01-08 
PRIOR APPLICATION NUMBER: 60/272,411 
PRIOR FILING DATE: 2001-02-28 
PRIOR APPLICATION NUMBER: 60/272,817 
PRIOR FILING DATE: 2001-03-02 
PRIOR APPLICATION NUMBER: 60/291,186 
PRIOR FILING DATE: 2001-05-15 



PRIOR APPLICATION NUMBER: 60/303,231 
PRIOR FILING DATE: 2001-07-05 
PRIOR APPLICATION NUMBER: 60/305,060 
PRIOR FILING DATE: 2001-07-12 
PRIOR APPLICATION NUMBER: 60/318,405 
PRIOR FILING DATE: 2001-09-10 
PRIOR APPLICATION NUMBER: 60/318,700 
PRIOR FILING DATE: 2001-09-12 
NUMBER OF SEQ ID NOS : 227 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 117 
LENGTH: 931 
TYPE: PRT 

ORGANISM: Caenorhabditis elegans 
US-10-037-417-117 

Query Match 59.6%; Score 2778.5; DB 15; Length 931; 

Best Local Similarity 58.2%; Pred. No. 2e-221; 

Matches 513; Conservative 150; Mismatches 195; Indels 23; Gaps 7; 

Qy 11 PGANPDLLPHFLVTSPEDvTIVTWKPVXLVCKAV 70 

I I : I I I I I : I I I : I I I I I I I I I III I I I I I : I I I I III I III:: I 
Db 54 PSDPPEPLPHFLIEPEEAYIVl^NKPWLYCKASPATQIYFKCNSEWVTIQKDHVVDERVDE 113 

Qy 71 SSGLPTMEWINVSRQQV^KVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEP 130 

: I I I II I : I II I I I :: I I I : I I I I I I I I I I : I I I I I : I I I : I I I III I I I I I 
Db 114 TSGLIVREVSIEISRQQVEELFGPEDYWCQCVAWSSAGTTKSRKAYVRIAYLRKTFEQEP 173 

Qy 131 LAKEVSLEQGIVXPCRPPEGIPPAEVT£WLRNEDLV^ 190 

I I I I I I I I : : I I I I I I I II I I I I I I : I I I : : I I : I I III : I : I : : : I I I I : I 
Db 174 LGKEVSLEQEVLLQCRPPEGIPVAEVEWLKNEDIIDPAEDRNFYITIDHNLIIKQARLSD 233 

Qy 191 TAN YTCVAKN I VARRRS AS AAVI VYVNGGWS TWT EW S VC S AS CGRGWQKRS RS CTN PAP L 250 

I I I I I I I I I I II I : I : I : I I I I I I I I I I I I I I I I I I I : : I I I I : I I I : I : I I I I I I I 
Db 234 TANYTCVAKNIVAKRKSTTATVIVYVNGGWSTWTEWSVCNSRCGRGYQKRTRTCTNPAPL 293 

Qy 251 NGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQ 310 

I I I I I I I I I : I I I II I I I I I I I I : I I I I I II : I I I I I III: I I I : I I I :: I 
Db 294 NGGAFCEGQSVQKIACTTLCPVDGRWTSWSKWSTCGTECTHWRRRECTAPAPKNGGKDCD 353 

Qy 311 GTDLDTRNCTSDLCVHSASGPEDVALYVGL-IAVAVCLVLLLLVLILVYCRKKEGLDSDV 369 

I I : : I I I I I : : I : I I I I I I I : I I I I I I : : : I : I I : : I I : 

Db 354 GLVXQSKNCTDGLCMQAAPDSDDVALWGIVIAVTVCIAITVWALFVTR 413 

Qy 370 ADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGH 429 

I I I I I I I I I : I I : : I I I : I I I : : I : I : I II : I I 
Db 414 I DS SALNGGFQPVNI KAARQD LLAVP PDLT SAAAMYRGPVYALHD- VS DKI PMTNS P 469 

Qy 430 LLSPLGGGRHTLHHSS PTSEAEEFVSRLS TQNYF RSLPRGT — S 471 

:| II : :::M I : II hi ||: Mill I 

Db 470 ILDPLPNLKIKVYNSSGAVTPQDDLAEFSSKLSPQMTQSLLENEALNLKNQSLARQTDPS 529 

Qy 472 NMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLS 531 

Mill III hi I 1:1:1 I II I I I I : I : : I I : I : I : I : I. : : I |: I I I I : 
Db 530 CTAFGTFNSLGGHLI I PNSGVSLLI PAGAI PQGRVTEMYVTVHRKENMRPPMEDSQTLLT 589 



Qy 



532 PIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYY 591 



I : I I I I I I I I I I I I I I I : I I : I I : I :: I I I :: I I I I I :: I I I : I 
Db 590 PWSCGPPGALLTRPVILTLHHCADPSTEDWKIQLKNQAVQGQWEDWWGEENFTTPCY 649 

Qy 592 CQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHD 651 

I I : I I I : : I I I : I I I I : : : I I II I I ! I : I I : I : I I I I : I I I I I I I I I 
Db 650 IQLDAEACHILTENLSTYALVGQSTTKAAAKRLKLAIFGPLCCSSLEYSIRVYCLDDTQD 709 

Qy 652 ALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHI 711 

I I I I I : I I I : I : I I I I :: I I : I I I I I I I I I I I [ I I : I I I I I I I I I I I I I I I I I 
Db 710 ALKEVLQLERQMGGQLLEEPKALHFKGSIHNLRLSIHDIAHSLWKSKLLAKYQEIPFYHI 769 

Qy 712 WNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALES 771 

I : I : I I I I I I I I I I I : I : I : I I I I I I I I I : I I I : I : : : : : I I : 
Db 770 WSGSQRNLHCTFTLERLSLNTVELVCKLCVRQVEGEGQIFQLNCTVSEEPTGIDLPLLDP 829 

Qy 772 EAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPT 831 

: : : I I I I I I I I I I I : I I I I I I I I I I I I I I : I I : I : : I I : I I I I 
Db 830 ASTITTVTGPSAFSIPLPIRQKLCSSLDAPQTRGHDWRMLAHKIjNLDRYLNYFATKSSPT 889 

Qy 832 AMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAE 872 

: I I : I I I I : : I I : I I I I I I I : : I : : : : I : 
Db 890 GVILDLWEAQNFPDGNLSMLAAVLEEMGRHETWSLAAEGQ 930 



Search completed: March 1, 2005, 09:51:29 
Job time : 172.022 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 



March 1, 2005, 08:41:47 



Search time 159.7 Seconds 

(without alignments) 

2799.282 Million cell updates/sec 



US-10-624-932-2_COPY_26_898 
4660 

1 QQSATVANPVPGANPDLLPH AVAGLGQPDAGLFTVSEAEC 873 



Scoring table: 
Searched: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 

1612378 seqs, 512079187 residues 



Total number of hits satisfying chosen parameters: 1612378 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : UniProt_03:* 

1 : uniprot_sprot : * 
2 : uniprot_trembl : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


4558 


97. 


8 


898 


1 


UN5A_MOUSE 


Q8kls4 


mus musculu 


2 


4511 


96. 


8 


898 


1 


UN5A_RAT 


008721 


rattus norv 


3 


4279 


91. 


8 


842 


1 


UN 5 A_HUMAN 


Q6zn44 


homo sapien 


4 


2778.5 


59. 


6 


931 


1 


UN5CJVIOUSE 


008747 


mus musculu 


5 


2762.5 


59. 


3 


931 


1 


UN5C RAT 


Q761x5 


rattus norv 


6 


2750.5 


59. 


0 


931 


1 


UN5C_CHICK 


Q7t2z5 


gallus gall 


7 


2747.5 


59. 


0 


931 


1 


UN5C_HUMAN 


095185 


homo sapien 


8 


2639 


56. 


6 


943 


1 


UN5B_XENLA 


Q8jgt4 


xenopus lae 


9 


2563.5 


55. 


0 


945 


1 


UN5B_MOUSE 


Q8kls3 


mus musculu 


10 


2560.5 


54. 


9 


945 


1 


UN5B_RAT 


008722 


rattus norv 


11 


2541.5 


54. 


5 


945 


1 


UN5B_HUMAN 


Q8izjl 


homo sapien 


12 


2185.5 


46. 


9 


953 


1 


UN5D HUMAN 


Q6uxz4 


homo sapien 


13 


2183 


46. 


8 


956 


1 


UN5D MOUSE 


Q8kls2 


mus musculu 


14 


1050.5 


22. 


5 


876 


2 


Q7PW78 


Q7pw78 


anopheles g 


15 


992 


21. 


3 


1072 


1 


UNC5 DROME 


Q95tu8 


drosophila 



1 c 


y / / 


ft T 

2 1 . 


U 


y iy 


1 




Q262ol 


caenorhabdi 


1 ft 
1 / 


o y 1 . b 


1 ft. 

iy . 


1 


1 C ft 

/ 59 


2 


/-\ •"7 T^r.Tft ft 

Q /rW / / 


Q7pW/ / 


anopheles g 


1 Q 

lo 


oyz 


14 . 


Q 
O 


1 Q Q 

i y y 


1 




y ypvao 


petromyzon 


iy 


O / / . D 


Q 

o . 


1 


ft C7 ft. 

2o /3 


2 


/ft ft. c cr*o 


Q96sc3 


homo sapien 


ft ft 

20 


3 / / . 5 


o 

o . 


1 


oo3o 


2 


/ft ft ftTVT ft 


Q96rw7 


homo sapien 


21 


370.5 


8 . 


0 


1244 


2 


Q69YJ3 


Q69yj3 


homo sapien 


22 


ft ft ft 

333 


1 . 


1 


1388 


2 


Q7QKD0 


Q7qkd0 


anopheles g 


ft ft 

23 


318 


6 . 


8 


ft ft c 

325 


2 


Q8I1K1 


Q8ilkl 


drosophila 


24 


300 


6 . 


4 


518 


2 


Q8IV45 


Q8iv45 


homo sapien 


25 


298 . 5 


6. 


4 


1584 


1 


BAI INHUMAN 


014514 


homo sapien 


26 


ft ft c c 

296.5 


6. 


4 


TITO 

1172 


1 


TSP2 HUMAN 


P35442 


homo sapien 


27 


ft ft o 
293 


6 . 


3 


T ft ft » 

1074 


1 


SM5A human 


Q13591 


homo sapien 


28 


293 


6. 


3 


1172 


1 


TSP2_M0USE 


Q03350 


mus musculu 


29 


293 


6. 


3 


1172 


2 


Q7TMT3 


Q7tmt3 


mus musculu 


30 


ft ft ft 

293 


6 . 


3 


1 1 ft ft 

1172 


2 


Q8CG21 


Q8cg21 


mus musculu 


31 


ft ft ft 

292 


6 . 


3 


1582 


2 


Q8CGM0 


Q8cgm0 


mus musculu 


32 


ft n n c 

291 . 5 


6 . 


3 


1170 


1 


TSP2 BOVIN 


Q95116 


bos taurus 


33 


ft rv i 
291 


6 . 


2 


1 ft ft" "7 

1077 


1 


SM5A MOUSE 


Q62217 


mus musculu 


34 


288.5 


6. 


2 


1088 


2 


Q6PCK8 


Q6pck8 


xenopus lae 


35 


286 


6. 


1 


1081 


2 


Q9U631 


Q9u631 


drosophila 


36 


285 


6. 


1 


1091 


2 


Q7YU67 


Q7yu67 


drosophila 


37 


285 


6 . 


1 


1093 


2 


Q9VTT0 


Q9vtt0 


drosophila 


38 


284 


6. 


1 


518 


2 


Q6R653 


Q6r653 


mus musculu 


39 


280 


6 . 


0 


1173 


1 


TSP1_XENLA 


P35448 


xenopus lae 


40 


276 


5 . 


9 


1093 


1 


SM5B_HUMAN 


Q9p283 


homo sapien 


41 


276 


5 . 


9 


1151 


2 


Q6DD89 


Q6dd89 


homo sapien 


42 


276 


5. 


9 


1461 


2 


Q8MYA8 


Q8mya8 


caenorhabdi 


43 


275.5 


5. 


9 


632 


2 


Q6ZPQ8 


Q6zpq8 


mus musculu 


44 


275.5 


5. 


9 


1093 


1 


SM5B MOUSE 


Q60519 


mus musculu 


45 


275.5 


5. 


9 


1122 


2 


Q7TT33 


Q7tt33 


mus musculu 



ALIGNMENTS 



RESULT 1 
UN5A_M0USE 

ID UN5AJVI0USE STANDARD; PRT; 898 AA. 

AC Q8K1S4; Q6PEF7; Q80T71; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5A precursor (Unc-5 homolog A) (Unc-5 homolog 1) . 

GN Name=Unc5a; Synonyms=Kiaal976, Unc5hl; 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1), AND TISSUE SPECIFICITY. * 

RX MEDLINE=22239710; PubMed=12351186; DOI=10 . 1016/S0925-4773 ( 02) 00248-4 ; 

RA Engelkamp D. ; 

RT "Cloning of three mouse unc-5 genes and their expression patterns at 

RT mid-gestation . " ; 

RL Mech. Dev. 118:191-197(2002). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 3). 



RC TISSUE=Brain; 

RX MEDLINE=22579291; PubMed=12693553 ; 

RA Okazaki N. , Kikuno R. , Ohara R. , Inamoto S., Aizawa H., Yuasa S., 

RA Nakajima D., Nagase T., Ohara O., Koga H.; 

RT "Prediction of the coding sequences of mouse homologues of KIAA gene: 

RT II. The complete nucleotide sequences of 400 mouse KIAA- homologous 

RT cDNAs identified by screening of terminal sequences of cDNA clones 

RT randomly sampled from size-fractionated libraries."; 

RL DNA Res. 10:35-48(2003). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM 2) . 

RC STRAIN=C57BL/6; TISSUE=Brain; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K. H . , Schaefer C. F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M., Soares M.B., Bonaldo M. F. , Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan. P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M. , Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M., Madan A. , Rodrigues S., Sanchez A. , 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. f Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J. , Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M. I . , Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand (By 

CC similarity) . 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC. Interacts 
CC with MAGED1. Interacts with PRKCABP, possibly mediating some 

CC interaction with PKC (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. The interaction 
CC with PRKCABP regulates its surface expression and leads to its 

CC removal from surface of neurons and growth cones (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=3; 

CC Name=l; 

CC IsoId=Q8KlS4-l; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=Q8KlS4-2; Sequence=VSP_011697 ; 

CC Note=No experimental confirmation available; 

CC Name=3 ; 

CC IsoId=Q8KlS4-3; Sequence=VSP_011696; 

CC Note=No experimental confirmation available; 

CC -!- TISSUE SPECIFICITY: Restricted to central nervous system. 



CC -!- DOMAIN: The ZU5 domain mediates the interaction with MAGED1, which 
CC participates in the induction of apoptosis (By similarity) . 

CC -!- PTM: Phosphorylated by PKC in vitro. Phosphorylated on cytoplasmic 
CC tyrosine residues (By similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 

CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis (By similarity) . 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AJ487852; CAD32250.1; -. 

DR EMBL; AK122575; BAC65857.1; ALT_INIT. 

DR EMBL; BC058084; AAH58084.1; -. 

DR HSSP; P07996; 1LSL. 

DR MGD; MGI: 894 682; Unc5a. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR008085; TSP_1. 

DR InterPro; IPR000906; ZU5. 1 

DR Pfain; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR. Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01705; TS PI REPEAT . 

DR SMART; SM00005; DEATH; 1. 

DR SMART; SM00409; IG; 1. 

DR SMART; SM00209; TSP1; 2. 

DR SMART; SM00218; ZU5; 1. 

DR PROSITE; PS50017; DEATH_DOMAIN; FALSE_NEG. 

DR PROSITE; PS50835; IG_LIKE; 1. 

DR PROSITE; PS50092; TSP1; 2. 

KW Alternative splicing; Apoptosis; Developmental protein; 

KW Immunoglobulin domain; Phosphorylation; Receptor; Repeat; Signal; 

KW Transmembrane. 

FT SIGNAL 1 25 Potential. 

FT CHAIN 26 898 Netrin receptor UNC5A. 

FT DOMAIN 26 361 Extracellular (Potential) . 

FT TRANSMEM 362 382 Potential. 

FT DOMAIN 383 898 . Cytoplasmic (Potential) . 

FT DOMAIN ■ 44 141 Ig-like. 

FT DOMAIN 155 234 Ig-like C2-type. 



FT 


UUM/UN 


O AO 




men f , ma 1 i 

.Lor cype i i. 


FT 


DOMAIN 


o Ci o 


ooU 


T5P type-1 Z. 


FT 


TtOX/T A TXT 

DOMAIN 


4 y o 


oy o 


•7 TT Q 
Z.UD . 


FT 


n<^\XA 7V T XT 


Q1 T 

ol / 


oy / 


ueatn . 


FT 


SITE 


O A iC 

396 


397 


Cleavage (by caspase-3) (By similarity) . 


FT 


SITE 


661 


679 


Interaction with DCC (By similarity) . 


FT 


DISULFID 


65 


124 


By similarity . 


FT 


DISULFID 


170 


221 


By similarity. 


FT 


CARBOHYD 


107 


107 


N-lmked (GlcNAc. . . ) (Potential). 


FT 


CARBOHYD 


218 


218 


N-lmked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


343 


343 


N-linked (GlcNAc. . .) (Potential). 


FT 


VARSPLIC 


1 


790 


Missing (in isoform 3) . 


FT 








/FTId=VSP 011696. 


FT 


VARSPLIC 


241 


296 


Missing (in isoform 2) . 


FT 








/FTId=VSP_011697. 


FT 


CONFLICT 


217 


217 


A -> P (in Ref . 3) . 


SQ 


SEQUENCE 


898 AA; 


98856 


MW; 59F04BA2E196C1DB CRC64; 



Query Match 97.8%; Score 4558; DB 1; Length 898; 

Best Local Similarity 96.7%; Pred. No. 0; 

Matches 844; Conservative 19; Mismatches 10; Indels 0; Gaps 0; 



Qy 1 QQSATVANPVPGANPDLLPHFLV^PEDWIVTCNKPVXLVCKAVPATQIFFKCNGEWVRQV 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 26 QQSATVANPVPGANPDLLPHFLv^PEDVTIVT^NKPV^LVCKAVPATQIFFKCNGEWVRQV 85 

Qy 61 DHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 120 

I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 86 DHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 145 

Qy 121 RLRKNFEQEPLAKEVSLEQGIvXPCRPPEGIPPAEVEWLRNEDLVI)PSLDP^^^ITREHS 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 14 6 YLRKNFEQEPLAKEVSLEQGI VTjPCRPPEGIPPAEV^WLRNEDLVDPSLDPNVYITREHS 205 

Qy 181 LWRQARLADTANYTCVAKNIVARRRSASAAV^ 240 

I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 206 LVWQARIJUDTANYTCVAIO^IVARRRSASAAVIvTWGGWSTWTEWSVCSASCGRGWQK^ 265 

Qy 241 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 300 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 266 SRSCTNPAPIJ^GGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 325 

Qy 301 APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 360 

111111111:1 I I I I I II I I I I I : I : : I I I I I II II : I I : I I I I I I : I I I I I I : I : I I I 
Db 326 APRNGGEECRGADLDTRNCTSDLCLHTSSGPEDVALYIGLVAVAVCLILLLLVLVLIYCR 385 

Qy 361 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 386 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 445 

Qy 421 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 480 

I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I II I I I I I I I I I I MINI 
Db 446 PKFQLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVSRLSTQNYFRSLPRGTSNMAYGTFNF 505 

Qy 481 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 506 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 565 



Qy 


541 


VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCE 


600 






1 II I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 Ml 




Db 


566 


VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEESPSHLYYCQLEAGACY 


625 


Qy 


601 


VFT EQ LG RFALVG EAL S VAAAKRLKLLL FAP VACT S LE YN I RVYC LH DT HDALKEWQL E 


r r C\ 

660 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 II 1 1 II 1 1 II 1 1 




Db 


626 


VFT EQLG RFALVG EIALSVAATKRLRLLLFAPVACTSLEYN I RWCLHDTH DAL KEVVQLE 


685 


Qy 


661 


KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 


720 






1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 : 1 1 1 




Db 


686 


KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQQYLH 


745 


Qy 


721 


CT FTLERVS P ST S DLACKLWVWQVEGDGQS FS I N FNI TKDTRFAELLALES EAGVPALVG 


780 






1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 : 1 II 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


746 


CTFTLERWASTSDIACKVWWQVEGDGQSFNINFNITKDTRFAEMIALESEGGVPALVG 


805 


Qy 


781 


PSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


806 


PSAFKIPFLIRQKIITSLDPPCSRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEA 


865 


Qy 


841 


RHFPNGNLSQ LAAAVAGLGQ P DAGL FTVS EAEC 873 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


866 


RH FPN GN LGQ LAAAVAGLGQ P DAGL FTVS EAEC 898 





RESULT 2 



UN5A_ 


_RAT 




ID 


~UN5A RAT STANDARD; 


PRT; 898 AA. 


AC 


008721; 




DT 


25-OCT-2004 (Rel. 45, Created) 




DT 


25-OCT-2004 (Rel. 45, Last sequence update) 


DT 


25-OCT-2004 (Rel. 45, Last annotation update) 


DE 


Netrin receptor UNC5A precursor (Unc-5 homolog A) (Unc-5 homolog 1) . 


GN 


Name=Unc5a; Synonyms=Unc5hl ; 




OS 


Rattus norvegicus (Rat) . 




OC 


Eukaryota; Metazoa; Chordata; 


Craniata; Vertebrata; Euteleostomi ; 


OC 


Mammalia; Eutheria; Rodentia; 


Sciurognathi; Muridae; Murinae; Rattus. 


OX 


NCBI TaxID=10116; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. , FUNCTION, 


SUBCELLULAR LOCATION, AND TISSUE 


RP 


SPECIFICITY. 




RC 


TISSUE=Ventral spinal cord; 




RX 


MEDLINE=97271897; PubMed=9126742 ; 


RA 


Leonardo E.D., Hinck L., Masu 


M. , Keino-Masu K. , Ackerman S.L., 


RA 


Tessier-Lavigne M. ; 




RT 


"Vertebrate homologues of C. elegans UNC-5 are candidate netrin 


RT 


receptors."; 




RL 


Nature 386:833-838(1997). 




RN 


[2] 




RP 


FUNCTION, AND INTERACTION WITH DCC. 


RX 


PubMed=10399920; 




RA 


Hong K., Hinck L., Nishiyama M. , Poo M.-M., Tessier-Lavigne M., 


RA 


Stein E. ; 




RT 


"A ligand-gated association between cytoplasmic domains of UNC5 and 


RT 


DCC family receptors converts 


netrin-induced growth cone attraction to 


RT 


repulsion. " ; 





RL Cell 97:927-941(1999). 

RN [3] 

RP TISSUE SPECIFICITY. 

RX PubMed=11472849; 

RA Barrett C, Guthrie S.; 

RT "Expression patterns of the netrin receptor UNC5H1 among developing 

RT motor neurons in the embryonic rat hindbrain. " ; 

RL Mech. Dev. 106:163-166(2001). 

RN [4] 

RP FUNCTION. 

RX PubMed=11387206; DOI=10 . 1093/emboj /20 . 11 . 2715 ; 

RA Llambi F. , Causeret F. , Bloch-Gallego E., Mehlen P.; 

RT "Netrin-l acts as a survival factor via its receptors UNC5H and DCC"; 

RL EMBO J. 20:2715-2722(2001). 

RN [5] 

RP FUNCTION, SUBCELLULAR LOCATION, AND INTERACTION WITH MAGED1 . 

RX PubMed=12598531; DOI=10 . 1074/ jbc .M300415200; 

RA Williams M.E., Strickland P . , Watanabe K. , Hinck L.; 

RT "UNC5H1 induces apoptosis via its juxtamembrane region through an 

RT interaction with NRAGE . " ; 

RL J. Biol. Chem. 278:17483-17490(2003). 

RN [6] 

RP INTERACTION WITH PRKCABP, PHOSPHORYLATION, AND MUTAGENESIS OF 

RP 896-ALA— CYS-898. 

RX PubMed=14672991; DOI=23/36/11279 ; 

RA Williams M.E., Wu S.C.-Y., McKenna W.L., Hinck L.; 

RT "Surface expression of the netrin receptor UNC5H1 is regulated through 

RT a protein kinase C-interacting protein/protein kinase-dependent 

RT mechanism."; 

RL J. Neurosci. 23:11279-11288(2003). 

CC FUNCTION: Receptor for netrin required for axon guidance. Mediates 

CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a. dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand. 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC. Interacts 
CC with MAGED1. Interacts with PRKCABP, possibly mediating some 

CC interaction with PKC. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. The interaction 
CC with PRKCABP regulates its surface expression and leads to its 

CC removal from surface of neurons and growth cones. 

CC -!- TISSUE SPECIFICITY: Mainly expressed in regions of differentiating 
CC neurons. Expressed at early stages of neural tube development in 

CC the ventral spinal cord. In developing hindbrain, it colocalizes 

CC with a number of cranial motor neuron subpopulations from 

CC embryonic Ell to E14, while DCC is expressed by motor neurons at 

CC E12. Also expressed in non-neural structures, such as the basal 

CC plane of the hindbrain and midbrain, in the developing 

CC hypothalamus, thalamus and in the pallidium. 

CC -!- DOMAIN: The ZU5 domain mediates the interaction with MAGED1, which 

CC participates in the induction of apoptosis. 

CC PTM: Phosphorylated on cytoplasmic tyrosine residues (By 

CC similarity) . Phosphorylated by PKC in vitro. 

CC -!-'PTM: Proteolytically cleaved by caspases during apoptosis. The 
CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 



apoptosis . 

SIMILARITY: 

SIMILARITY: 

SIMILARITY: 

SIMILARITY: 

SIMILARITY: 

SIMILARITY: 



Belongs to the UN C- 5 family. 
Contains 1 death domain. 

Contains 1 immunoglobulin- like C2-type domain. 
Contains 1 immunoglobulin-like domain. 
Contains 2 TSP type-1 domains. 
Contains 1 ZU5 domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to licensedisb-sib . ch) . 

EMBL; U87305; AAB57678.1; -. 
HSSP; P07996; 1LSL. 
RGD; 621755; Unc5hl. 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 
InterPro 



Death. 

DEATH_like. 

Ig. 



IPR000488; 
IPR011029; 
IPR003599; 
IPR007110; Ig-like. 
IPR000884; TSP1. 
IPR008085; TSP_1. 
IPR000906; ZU5 . 
Pfam; PF00531; Death; 1. 
Pfam; PF00047; ig; 1. 
Pfam; PF00090; TSP_1; 2. 
Pfam; PF00791; ZU5; 1. 
PRINTS; PRO 17 05; TS PI REPEAT . 
SMART; SM00005; DEATH; 1. 
SMART; SM00409; IG; 1. 
SMART; SM00209; TSP1; 2. 
SMART; SM00218; ZU5; 1. 

PROSITE; PS50017; DEATH_DOMAIN ; FALSE_NEG. 
PROSITE; PS50835; IG_LIKE; 1. 
PROSITE; PS50092; TSP1; 2. 

Apoptosis; Developmental protein; Immunoglobulin domain; 
Phosphorylation; Receptor; Repeat; Signal; Transmembrane. 



FT 


SIGNAL 


1 


25 


Potential . 


FT 


CHAIN 


26 


898 


Netrin receptor UNC5A. 


FT 


DOMAIN 


26 


361 


Extracellular (Potential) . 


FT 


TRANSMEM 


362 


382 


Potential. 


FT 


DOMAIN 


383 


898 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


44 


141 


Ig-like. 


FT 


DOMAIN 


155 


238 


Ig-like C2-type. 


FT 


DOMAIN 


242 


296 


TSP type-1 1. 


FT 


DOMAIN 


298 


350 


TSP type-1 2. 


FT 


DOMAIN 


495 


598 


ZU5. 


FT 


DOMAIN 


817 


897 


Death. 


FT 


SITE 


396 


397 


Cleavage (by caspase-3) (By similarity) . 


FT 


SITE 


661 


679 


Interaction with DCC (By similarity) . 


FT 


DISULFID 


65 


124 


By similarity. 


FT 


DISULFID 


170 


221 


By similarity. 


FT 


CARBOHYD 


107 


107 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


218 


218 


N-linked (GlcNAc. . .) (Potential). 



FT CARBOHYD 343 343 N-linked (GlcNAc. . .) (Potential). 

FT MUTAGEN 896 898 Missing: Abolishes interaction with 

FT PRKCABP. 

SQ SEQUENCE 898 AA; 98840 MW; 7A3CBCB9E7ACA135 CRC64; 



Query Match 96.8%; Score 4511; DB 1; Length 898; 

Best Local Similarity 96.0%; . Pred. No. 0; 

Matches 838; Conservative 17; Mismatches 18; Indels 0; Gaps 0; 



Qy 1 QQSATVANPVPGANPDLLPHFLv^PEDWIVT^KPVXLVCKAVPATQIFFKCNGEWVRQV 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 26 QQSATVANPVPGANPDLLPHFLV^PEDWIVKNKPVLLVCKAVPATQIFFKCNGEWVRQV 85 

Qy 61 DHVI ERSTDGS SGLPTME VT^INVSRQQ V^KVFGLEE YWCQCVAWS S S GTTKSQKAYI RI A 120 

I I I I I I I I I I I I I I I I I I I I I I I II t I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 86 DHVIERSTDSSSGLPTMEVRINVSRQQV1EKVTGLEEYWCQCVAWSSSGTTKSQKAYIRIA 145 

Qy 121 RLRKNFEQEPIJ^EVSLEQGIvTiPCRPPEGIPPAEV^WLRNEDLVTlPSLDPNVYITREH^ 180 

II I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 146 YLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEV^WLRNEDLVDPSLDPNVYITREHS 205 

Qy 181 LVVRQARLADTANYTCVAKNIVARRRSASAAVI VY 240 

I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 206 LvVRQARLADTANYTCv7UOtfIVARRRSTSAAVIVYW 265 

Qy 241 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 300 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 266 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSSWSKWSACGLDCTHWRSRECSDP 325 

Qy 301 APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 360 

111111111:1 I I I I I I I I I I I I : I : I I I I I I I I I : I I : I I I I I I I I I I I hill 

Db 326 APRNGGEECRGADLDTRNCTSDLCLHTASCPEDVALYIGLVAVAVCLFLLLLALGLIYCR 385 

Qy 361 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

Db 386 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCSRQDGPS 445 

Qy 421 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 480 

I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 446 PKFQLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVSRLSTQNYFRSLPRGTSNMAYGTFNF 505 

Qy 481 LGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLS PI VSCGPPG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I 

Db 506 LGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLSPWSCGPPG 565 

Qy 541 VLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I III 

Db 566 VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEESPSHLYYCQLEAGACY 625 

Qy 601 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 660 

I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 626 VFTEQLGRFALVGEALSVAATKRLRLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 685 

Qy 661 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I 

Db 686 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQQYLH 745 



Qy 721 CTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVG 780 

I I I I I I I : : I I I I I I I I : I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 746 CTFTLERINASTSDI^CKVWVWQVEGDGQSFNINFNITKDTRFAELLALESEGGVPALVG 805 

Qy 781 PSAFKI PFLI RQKI I SSLDPPCRRGADWRTLAQKLHLDSHLS FFASKPS PTAMI LNLWEA 840 

I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II 

Db 806 PSAFKI PFLI RQKI IASLDP PCS RGADWRTLAQKLHLDSHLS FFASKPS PTAMI LNLWEA 865 

Qy 841 RHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 873 

II I I II II I II I I I I I I I I I I I I I I I I I I I I I 

Db 866 RHFPNGNLGQLAAAVAGLGQPDAGLFTVSEAEC 898 



RESULT 3 
UN5A_HUMAN 

ID UN5A_HUMAN STANDARD; PRT; 842 AA. 

AC Q6ZN44; Q8TF26; Q96GP4; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5A precursor (Unc-5 homolog A) (Unc-5 homolog 1) . 

GN Name=UNC5A; Synonyms=KIAA1976, UNC5H1; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RC TISSUE=Brain; 

RX PubMed=14702039; DOI=10 . 1038/ngl285 ; 

RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T. # Irie R., 

RA Wakamatsu A., Hayashi K., Sato H., Nagai K., Kimura K., Makita H., 

RA Sekine M. , Obayashi M. , Nishi T., Shibahara T., Tanaka T., Ishii S., 

RA Yamamoto J. -I., Saito K. , Kawai Y., Isono Y., Nakamura Y. , 

RA Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M. , 

RA Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., 

RA Sugawara M., Takahashi M., Kanda K. , Yokoi T., Furuya T., Kikkawa E., 

RA Omura Y. , Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M. , 

RA Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K. , 

RA Fujimori K., Tanai H., Kimata M., Watanabe M. , Hiraoka S., Chiba Y., 

RA Ishida S., Ono Y. , Takiguchi S., Watanabe S., Yosida M. , Hotuta T., 

RA Kusano J., Kanehori K., Takahashi-Fujii A., Hara H., Tanase T.-O., 

RA Nomura Y., Togiya S., Komai F. , Hara R. , Takeuchi K., Arita M., 

RA Imose N . , Musashino K. , Yuuki H., Oshima A., Sasaki N., Aotsuka S., 

RA Yoshikawa Y. , Matsunawa H., Ichihara T., Shiohata N., Sano S., 

RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., 

RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H. r 

RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M. , Kawakami B., 

RA Yamazaki M., Watanabe K., Kumagai A., Itakura S. f Fukuzumi Y-, 

RA Fujimori Y., Komiyama M. , Tashiro H., Tanigami A., Fujiwara T., 

RA Ono T., Yamada K. , Fujii Y. , Ozaki K. f Hirao M., Ohmori Y., 

RA Kawabata A., Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., 

RA Okitani R. , Kawakami T., Noguchi S., Itoh T. , Shigeta K. , Senba T., 

RA Matsumura K. , Nakajima Y., Mizuno T. f Morinaga M. , Sasaki M. , 

RA Togashi T., Oyama M. , Hata H., Watanabe M. , Komatsu T., 

RA Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y., Nakagawa K. , 

RA Okumura K., Nagase T., Nomura N . , Kikuchi H., Masuho Y., Yamashita R. , 



RA Nakai K., Yada T., Nakamura Y., Ohara 0., Isogai T., Sugano S.; 

RT "Complete sequencing and characterization of 21,243 full-length human 

RT cDNAs . " ; 

RL Nat. Genet. 36:40-45(2004). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 3), AND SEQUENCE OF 302-842 FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L. , Marusina K., Farmer A. A. , Rubin G .M. , Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci . U.S.A. 99:16899-16903(2002). 

RN [3] 

RP SEQUENCE OF 624-728 FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=21842142; PubMed=11853319 ; 

RA Nagase T., Kikuno R. , Ohara O.; 

RT "Prediction of the coding sequences of unidentified human genes. XXII. 

RT The complete sequences of 50 new cDNA clones which code for large 

RT proteins."; 

RL DNA Res. 8:319-327(2001). 

RN [4] 

RP INDUCTION. 

RX PubMed=12598906; DOI=10 . 1038/ncb943; 

RA Tanikawa C, Matsuda K. , Fukuda S., Nakamura Y., Arakawa H.; 

RT "p53RDLl regulates of p53-dependent apoptosis . " ; 

RL Nat. Cell Biol. 5:216-223(2003). 

RN [5] 

RP DOWN- REGULATION IN CANCER. 

RX PubMed=12655055; DOI=10 . 1073/pnas . 0.7.38063100; 

RA Thiebault K. , Mazelin L., Pays L., Llambi F., Joly M.-O., 

RA Scoazec J.-Y., Saurin J.-C, Romeo G., Mehlen P.; 

RT "The netrin-1 receptors UNC5H are putative tumor suppressors 

RT controlling cell death commitment."; 

RL Proc. Natl. Acad. Sci. U.S.A. 100:4173-4178(2003). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand. 



CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC. Interacts 
CC with MAGED1 . Interacts with PRKCABP, possibly mediating some 

CC interaction with PKC (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. The interaction 
CC with PRKCABP regulates its surface expression and leads to its 

CC removal from surface of neurons and growth cones (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=3; 

CC Name=l; 

CC IsoId=Q6ZN44-l; Sequence=Displayed; 

CC Note=No experimental confirmation available; 

CC Name=2 ; 

CC IsoId=Q6ZN44-2; Sequence=VSP_011694 , VSP_011695; 

CC Note=No experimental confirmation available; 

CC Name =3 ; 

CC IsoId=Q6ZN44-3; Sequence=VSP_011693 ; 

CC Note=No experimental confirmation available; 

CC -!- INDUCTION: By p53/TP53. 

CC -!- DOMAIN: The ZU5 domain mediates the interaction with MAGED1, which 
CC participates in the induction of apoptosis (By similarity) . 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues. 
CC Phosphorylated by PKC in vitro (By similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 
CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis (By similarity) . 

CC -!- MISCELLANEOUS: Down-regulated in multiple cancers including 
CC colorectal, breast, ovary, uterus, stomach, lung, or kidney 

CC cancers. 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 1 TSP type-1 domain. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC -!- CAUTION: Ref.3 sequence differs from that shown due to the 
CC presence of introns. 

CC ■ 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the. EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC -: 

DR EMBL; AK131380; BAD18531.1; 

DR EMBL; BC009333; AAH09333.2; 

DR EMBL; BC033727; -; NOT_ANNOTATED_CDS . 

DR EMBL; AB075856; BAB85562.1; ALT_SEQ. 

DR Genew; HGNC:12567; UNC5A. 

DR MIM; 607869; -. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR000884; TSP1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP 1; 1. 



DR 


SMART; SM00409; IG 


1; 1- 






DR 


SMART; SM00209; TSP1; 1 






DR 


PROSITE; 


PS50017; 


DEATH 


DOMAIN; FALSE NEG. 


DR 


PROSITE; 


PS50835; 


IG LIKE; 


1. 


DR 


PROSITE; 


PS50092; 


TSP1; 


1. 




KW 


Alternative splicing; Apoptosis; Developmental protein; 


KW 


Immunoglobulin domain; 


Phosphorylation; Receptor; Signal; 


KW 


Transmembrane . 








FT 


SIGNAL 


1 


25 




Potential . 


FT 


CHAIN 


26 


842 




Netrin receptor UNC5A. 


FT 


DOMAIN 


1 


306 




Extracellular (Potential) . 


FT 


TRANS MEM 


307 


327 




Potential . 


FT 


DOMAIN 


328 


842 




Cytoplasmic (Potential) . 


FT 


DOMAIN 


44 


141 




Ig-like. 


FT 


DOMAIN 


155 


234 




Ig-like C2-type. 


FT 


DOMAIN 


242 


294 




TSP type-1 . 


FT 


DOMAIN 


439 


542 




ZU5. 


FT 


DOMAIN 


761 


841 




Death. 


FT 


SITE 


340 


341 




Cleavage (by caspase-3) (By similarity) . 


FT 


SITE 


605 


623 




Interaction with DCC (By similarity) . 


FT 


DISULFID 


65 


124 




By similarity. 


FT 


DISULFID 


170 


221 




By similarity. 


FT 


CARBOHYD 


107 


107 




N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


218 


218 




N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


287 


287 




N-linked (GlcNAc. . .) (Potential). 


FT 


VARSPLIC 


1 


97 




MAVRP GLWPALLGI VLAAWLRGS GAQQS ATVAN PVPGAN P D 


FT 










LLPHFLVEPEDVYIVKNKPVLLVCKAVPATQIFFKCNGEWV 


FT 










RQVDHVIERSTDGSN -> MAGTSERSLISSISQPKAIECF 


FT 










EVKKKAFLTHGRYHGSGATPPKTKDPKPETFCGQT ( in 


FT 










isoform 3) . 


FT 










/FTId=VSP_011693. 


FT 


VARSPLIC 


296 


301 




TASGPE -> SESSLP (in isoform 2). 


FT 










/FTId=VSP_011694. 


FT 


VARSPLIC 


302 


842 




Missing (in isoform 2) . 


FT 










/FTId=VSP 011695. 


SQ 


SEQUENCE 


842 AA; 


92958 


MW; 3DFADCF973131849 CRC64; 



Query Match 91.8%; Score 4279; DB 1; Length 842; 

Best Local Similarity 93.2%; Pred. No. 1.4e-313; 

Matches 814; Conservative 2; Mismatches 1; Indels 56; Gaps 1; 



Qy 1 QQSATVANPVPGANPDLLPHFLV^PEDVTIVlCNKPvXLVCKAVPATQIFFKCNGEWVRQV 60 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 26 QQ SAT VAN P VP GAN P DL L PH FLVE P EDVYI VKN KP VLLVC KAVP ATQ I F FKCN GEWVRQV 85 

Qy 61 DHVTERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 120 

I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 86 DHVIERSTDGSNGLPTMEWINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIA 145 

Qy 121 RLRKNFEQEPLAKEVSLEQGIVIjPCRPPEGIPPAEVTIWLRNEDLVBPSLDPNVYITREHS 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 146 YLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAEv^WLRNEDLVIDPSLDPNVYITREHS 205 

Qy 181 LWRQARLADTANYTCVAKNIVARRRS 240 

I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I II I 
Db 206 LWRQARLADTANYTCVAKNIVARRRSASAAVIVY 240 



Qy 

Db 



241 
241 



SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

VDGSWSPWSKWSACGLDCTHWRSRECSDP 269 



Qy 301 APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 360 

I I I I I I I I I I I it I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I t I I I I I 
Db 270 APRNGGEECQGTDLDTRNCTSDLCVHTASGPEDVALYVGLIAVAVCLVLLLLVLILVYCR 329 

Qy 361 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 330 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 389 

Qy 421 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 1 I I I I 
Db 390 PKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNF 449 

Qy 481 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 540 

I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 450 LGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPG 509 

Qy 541 VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 510 VLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACY 569 

Qy 601 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 660 

II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 570 VFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLE 629 

Qy 661 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 720 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 630 KQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLH 68 9 

Qy 721 CTFTLERVSPSTSDIACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVG 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 690 CTFTLERVSPSTSDLACKLWWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVG 749 

Qy 781 PSAFKI PFLI RQKI I SSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPS PTAMI LNLWEA 840 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 750 PSAFKI PFLI RQKI I SSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPS PTAMI LNLWEA 809 

Qy 841 RH F PNGN L S QLAAAVAGLGQ P DAGL FT VS EAE C 873 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 810 RHFPNGNLS QLAAAVAGLGQ P DAGL FT VS EAE C 842 



RESULT 4 ... 
UN5C_MOUSE 

ID UN5C_MOUSE STANDARD; PRT; 931 AA. 

AC 008747; Q8CD16; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5C precursor (Unc-5 homolog C) (Unc-5 homolog 3) 
DE (Rostral cerebellar malformation protein) . 
GN Name=Unc5c; Synonyms=Rcm, Unc5h3; 
OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 



OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus . 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1), FUNCTION, DISEASE, AND TISSUE 

RP SPECIFICITY. 

RC STRAIN=C57B6/SJL; 

RX MEDLINE=97271898; PubMed=9126743; 

RA Ackerman S.L., Kozak L.P., Przyborski S.A. , Rund L.A., Boyer B.B., 

RA Knowles B.B.; 

RT "The mouse rostral cerebellar malformation gene encodes an UNC-5-like 

RT protein."; 

RL Nature 386:838-842 (1997) . 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RC STRAIN=C57BL/6J; TISSUE=Testis ; 

RX MEDLINE=22354683; PubMed=12466851 ; DOI=10 . 1038/nature01266; 

RA Okazaki Y., Furuno M., Kasukawa T., Adachi J., Bono H., Kondo S., 

RA Nikaido I., Osato N., Saito R. , Suzuki H., Yamanaka I., Kiyosawa H., 

RA Yagi K. , Tomaru Y., Hasegawa Y., Nogami A., Schonbach C, Gojobori T., 

RA Baldarelli R. , Hill D.P., Bult C, Hume D.A., Quackenbush J., 

RA Schriml L.M., Kanapin A., Matsuda H., Batalov S., Beisel K.W., 

RA Blake J. A. , Bradt D., Brusic V., Chothia C, Corbani L : E . , Cousins S., 

RA Dalla E., Dragani T.A., Fletcher C.F., Forrest A., Frazer K.S., 

RA Gaasterland T. f Gariboldi M. , Gissi C, Godzik A., Gough J., 

RA Grimmond S., Gustincich S., Hirokawa N., Jackson I. J., Jarvis E.D., 

RA Kanai A., Kawaji H., Kawasawa Y., Kedzierski R.M. , King B.L., 

RA Konagaya A., Kurochkin I.V., Lee Y., Lenhard B., Lyons P. A., 

RA Maglott D.R., Maltais L., Marchionni L. , McKenzie L., Miki H., 

RA Nagashima T., Numata K., Okido T., Pavan W.J., Pertea G., Pesole G., 

RA Petrovsky N., Pillai R., Pontius J.U., Qi D., Ramachandran S., 

RA Ravasi T., Reed J.C., Reed D.J., Reid J., Ring B.Z., Ringwald M. , 

RA Sandelin A., Schneider C, Semple C.A., Setou M. , Shimada K., 

RA Sultana R. , Takenaka Y. , Taylor M.S., Teasdale R.D., Tomita M. , 

RA Verardo R. , Wagner L., Wahlestedt C, Wang Y., Watanabe Y., Wells C, 

RA Wilming L.G., Wynshaw-Boris A., Yanagisawa M. , Yang I., Yang L., 

• RA Yuan Z., Zavolan M., Zhu Y. , Zimmer A., Carninci P., Hayatsu N., 

RA Hirozane-Kishikawa T., Konno H., Nakamura M. , Sakazume N., Sato K. , 

RA Shiraki T., Waki K., Kawai J., Aizawa.K., Arakawa T., Fukuda S., 

RA Hara A., Hashizume W., Imotani K., Ishii Y., Itoh M. , Kagawa I., 

RA Miyazaki A., Sakai K., Sasaki D., Shibata K., Shinagawa A., 

RA Yasunishi A., Yoshino M. , Waterston R. , Lander E.S., Rogers J., 

RA Birney E. , Hayashizaki Y. ; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

RN [3] 

RP FUNCTION, AND TISSUE SPECIFICITY. 

RX PubMed=9389662; 

RA Przyborski S.A., Knowles B.B., Ackerman S.L.; 

RT "Embryonic phenotype of Unc5h3 mutant mice suggests chemorepulsion 

RT during the formation of the rostral cerebellar boundary."; 

RL Development 125:41-50(1998). 

RN [4] 

RP INTERACTION WITH DCC. 

RX PubMed=10399920; 

RA Hong K., Hinck L., Nishiyama M. , Poo M.-M., Tessier-Lavigne M., 

RA Stein E. ; 



RT "A ligand-gated association between cytoplasmic domains of UNC5 and 

RT DCC family receptors converts netrin-induced growth cone attraction to 

RT repulsion . " ; 

RL Cell 97:927-941(1999). 

RN [5] 

RP PHOSPHORYLATION SITE TYR-568, AND MUTAGENESIS OF TYR-568 . 

RX PubMed=11533026; DOI=10 . 1074/ jbc.M103872200; 

RA Tong J. , Killeen M. , Steven R. , Binns K.L., Culotti J., Pawson T . ; 

RT "Netrin stimulates tyrosine phosphorylation of the UNC-5 family of 

RT netrin receptors and induces Shp2 binding to the RCM cytodomain. " ; 

RL J. Biol. Chem. 276:40917-40925(2001). 

RN [6] 

RP FUNCTION. 

RX PubMed=12451134; DOI=22/23/10346; 

RA Finger J.H., Bronson R.T., Harris B., Johnson K. , Przyborski S.A. , 

RA Ackerman S.L.; 

RT "The netrin 1 receptors Unc5h3 and Dec are necessary at multiple 

RT choice points for the guidance of corticospinal tract axons."; 

RL J. Neurosci. 22:10346-10356(2002). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. Also involved in corticospinal tract axon guidances 

CC independently of DCC. It also acts as a dependence receptor 

CC required for apoptosis induction when not associated with netrin 

CC ligand. 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC Isold=008747-1; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=008747-2; Sequence=VSP_011702 ; 

CC -!- TISSUE SPECIFICITY: Mainly expressed in regions of differentiating 
CC neurons. Highly expressed in brain and lung. Weakly expressed in 

CC. testis , ovary, spleen, thymus and bladder. Expressed at very low 

CC level in kidney, intestine and salivary gland. 

CC -!- PTM: Phosphorylated on different cytoplasmic tyrosine residues. 

CC Phosphorylation of Tyr-568 leads to an interaction with PTPN11 

CC phosphatase, suggesting that its activity is regulated by 

CC phosphorylation/dephosphorylation. Tyrosine phosphorylation is 

CC netrin-dependent . ' 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 

CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis (By similarity) . 

CC -!- DISEASE: Defects in Unc5c are the cause of rostral cerebellar 
CC malformation (Rem) . Rem is characterized by cerebellar and 

CC midbrain defects, apparently as a result of abnormal neuronal 

CC migration. 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC - ! - SIMILARITY.: Contains 1 death domain . 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains, 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 



-!- SIMILARITY: Contains 1 ZU5 domain. 



GO; 
GO; 
GO; 
GO; 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib. ch) . 

EMBL; U72634; AAB54103.1; 

EMBL; AK031655; BAC27495.1; -. 

HSSP; P07996; 1LSL. 

MGD; MGI: 1095412; Unc5c. 

GO; GO: 0005886; C:plasma membrane; IC. 

F:netrin receptor activity; IDA. 
F: protein binding; IDA. 
P: brain development; IMP. 
P: regulation of cell migration; IMP. 
IPR000488; Death. 
IPR011029; DEATH_like . 
IPR007110; Ig-like. 
IPR003598; Ig_c2. 
IPR000884; TSP1. 
IPR008085; TSP_1.- 
IPR000906; ZU5. 
Pfam; PF00531; Death; 1. 
Pfam; PF00047; ig; 1. 
Pfam; PF00090; TSP_1; 2. 
Pfam; PF00791; ZU5; 1. 
PRINTS; PR01705; TSP1REPEAT. 
SMART; SM00005; DEATH; 1. 
SMART; SM00408; IGc2; 1. 
SMART; SM00209; TSP1; 2. 
SMART; SM00218; ZU5; 1. 

PROSITE; PS50017; DEATHJDOMAIN; FALSE_NEG. 
PROSITE; PS50835; IG_LIKE; 1. 
PROSITE; PS50092; TSP1; 2. 

Alternative splicing; Apoptosis; Developmental protein; 
Immunoglobulin domain; Phosphorylation; Receptor; Repeat; Signal; 



GO: 0005042; 
GO: 0005515; 
GO:0007420; 
GO: 0030334; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 



KW 


Transmembrane . 






FT 


SIGNAL 


1 


40 


Potential. 


FT 


CHAIN 


41 


931 


Netrin receptor UNC5C. 


FT 


DOMAIN 


41 


380 


Extracellular (Potential) . 


FT 


TRANSMEM 


381 


401 


Potential . 


FT 


DOMAIN 


402 


931 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


62 


159 


Ig-like. 


FT 


DOMAIN 


161 


256 


Ig-like C2-type. 


FT 


DOMAIN 


260 


314 


TSP type-1 1. 


FT 


DOMAIN 


316 


368 


TSP type-1 2. 


FT 


DOMAIN 


528 


631 


ZU5 . 


FT 


DOMAIN 


850 


929 


Death. 


FT 


SITE 


415 


416 


Cleavage (by caspase-3) (By similarity) . 


FT 


SITE 


694 


712 


Interaction with DCC (By similarity) . 


FT 


DISULFID 


83 


142 


By similarity. 


FT 


DISULFID 


188 


239 


By similarity. 


FT 


MOD_RES 


568 


568 


Phosphotyrosine . 



FT CARBOHYD 236 236 N-linked (GlcNAc. . .) (Potential). 

FT CARBOHYD 361 361 N-linked (GlcNAc. . .) (Potential). 

FT VARSPLIC 370 370 A -> GFIYPISTEHRPQNEYGFSS (in isoform 2). 

FT /FTId=VSP_011702. 

FT MUTAGEN 568 568 Y->F: Abolishes interaction with PTPN11, 

FT leading to a increased level of 

FT phosphorylation. 

FT CONFLICT 16 16 L -> I (in Ref. 2). 

FT CONFLICT 733 733 H -> R (in Ref. 2). 

FT CONFLICT 924 924 S -> Y (in Ref. 2). 

SQ SEQUENCE 931 AA; 103062 MW; 8A5D951A4EECA179 CRC64; 

Query Match 59.6%; Score 2778.5; DB 1; Length 931; 

Best Local Similarity 58.2%; Pred. No. 2.2e-200; 

Matches 513; Conservative 150; Mismatches 195; Indels 23; Gaps 7; 

Qy 11 PGANPDLLPHFLVEPEDWIVT<NKPVXLVCKAVPATQIFFKCNGEWV^QVT)HVIERSTD^ 70 

I I : I I I I I : I I I : I I I I I I I I I I I I I I I I I : I I I I I I I I I I I : : I 
Db 54 PSDPPEPLPHFLIEPEEAYIVl^KPWLYCKASPATQIYFKCNSEWVTiQKDHVVDERVI)E 113 

Qy 71 SSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEP 130 

: I I I II I : I I I I I I : : I I . I : I I I I I I I I I I : I I I I I : I I I : I I I III I I I I I 
Db 114 TSGLIVREVSIEISRQQVEELFGPEDYWCQCVAWSSAGTTKSRKAYVRIAYLRKTFEQEP 173 

Qy 131 LAKEVSLEQGIVXPCRPPEGIPPAEVEWLRNEDLVT)PS 190 

I I I I I II I : : I I I I I I I I I I I I I I I : I I I : : I I : I I III : I : I : : : I I i I : I 
Db 174 LGKEVSLEQEVLLQCRPPEGI PVAEVEWLKNEDI I DPAEDRNFYITIDHNLI I KQARLSD 233 

Qy 191 TANYTCVAKNIVARRRSASAAVIVTWGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPL 250 

I I I I I I I I I I I I I : I : I : I I I I I I I I I I I I I I I I I I I : : I I I I : I I I : I : I I I I I I I 
Db 234 TANYTCVAKNIVAKRKSTTATVIVYVNGGWSTWTEWSVCNSRCGRGYQKRTRTCTNPAPL 293 

Qy 251 NGGAFCEGQNVQKTACATLCPVT)GSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQ 310 

111111111:111 II I I I I I I I I : I I I I I II : I I I I I III: I I I : I I I : : I 
Db 294 NGGAFCEGQSVQKIACTTLCPVDGRWTSWSKWSTCGTECTHWRRRECTAPAPKNGGKDCD 353 

Qy 311 GTDLDTRNCTSDLCVHSASGPEDVALYVGL-IAVAVCLVXLLLVLILVYCRKKEGLDSDV 369 

I I : : I I I I I : : I : I I I I I I I : I I I I I I : : : I : I I : : I I : 

Db 354 GLVXQSKNCTDGLCMQAAPDSDDVALWGIVIAWVCIAITVWALFVTRKNHRDFES^ 413 

Qy 370 ADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGH 429 

I I I I I I I I I : I I : : I I I : I I I : : I : I : I II : I I 
Db 414 I DS SALNGGFQPVNI KAARQD LLAVP PDLT SAAAMYRGPVYALHD- VS DKI PMTNS P 469 

Qy 430 LLSPLGGGRHTLHHSS PTSEAEEFVSRLS TQNYF RSLPRGT — S 471 

: I U- : : : : I I I : I I I : I I I I : Mill I 

Db 470 ILDPLPNLKIKVYNSSGAWPQDDLAEFSSKLSPQMTQSLLENEALNLKNQSLARQTDPS 529 

Qy 472 NMTYGTFNFLGGRLMI PNTGI SLLI PPDAIPRGKI YEI YLTLHKPEDVRLPLAGCQTLLS 531 

: I I II I I I I : I I I : I : I I I II I I I : I : : I I : I : I : I : I : : I I : MM: 
Db 530 CTAFGTFNSLGGHLI I PNSGVSLLI PAGAI PQGRVYEMYVTVHRKENMRPPMEDSQTLLT 589 

Qy 532 PIVSCGPPGVXLTRPVTLAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYY 591 

1:111111 I I I I I I I I I- : I I : I I : I : : I I I : : I I I I I : : I I ! : I 
Db 590 PWSCGPPGALLTRPVTLTLHHCADPSTEDWKIQLKNQAVQGQWEDWWGEENFTTPCY 649 



Qy 



592 CQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHD 651 



I I : I II:: II I : I I I I : : : I I I I I I I I : I I : I : I II I : I I I I I I II I 
Db 650 IQLDAEACHI LTENLST YALVGQSTTKAAAKRLKLAI FGPLCCS S LE YS I RVYCLDDTQD 709 

Qy 652 ALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHI 711 

I I I I I : I I I : I : I I I I : : I I : I I I I I MINIMI: I I I I I I I I I I I I I I I I I 
Db 710 ALKEVLQLERQMGGQLLEEPKALHFKGSIHNLRLSIHDIAHSLWKSKLLAKYQEIPFYHI 769 

Qy 712 WNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALES 771 

I : I : I I I I I I I I I I I : I : I : I I I I I I I I I : I I I ■: I : : : : : I I : 
Db 770 WSGSQRNLHCTFTLERLSLNTVELVCKLCVRQVEGEGQIFQLNCTVSEEPTGIDLPLLDP 829 

Qy 772 EAGVPALVGPSAFKI P FLI RQKI I S S LDP PCRRGADWRTLAQKLHLDSHLS FFAS KPS PT 831 

: : : I I I I I I I II I I : II II I I I II I II I I : I I : I : : I I : I I I I 
Db 830 ASTITTVTGPSAFS I PLPI RQKLCS SLDAPQTRGHDWRMLAHKLNLDRYLNYFATKSSPT 889 

Qy 832 AMI LN LWEARH F PNGN L S Q LAAAVAGL GQ P DAGL FT VS EAE 872 

: I I : I I I I : : I I : I I I I I I I : : I : : : : I : 
Db 890 GVILDLWEAQNFPDGNLSMLAAVLEEMGRHETWSLAAEGQ 930 

/ 

RESULT 5 
UN5C RAT 



ID UN5C_RAT STANDARD; PRT; 931 AA. 

AC Q761X5; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5C precursor (Unc-5 homolog C) (Unc-5 homolog 3) . 

GN Name=Unc5c; Synonyms=Unc5h3; 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Rattus. 

OX NCBIJTaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. , AND DISEASE. 

RX PubMed=15010202; DOI=10 . 1016/ j .molbrainres .2003 . 12 . 003; 

RA Kuramoto T., Kuwamura M. , Serikawa T.; 

RT "Rat neurological mutations cerebellar vermis defect and hobble are 

RT caused by mutations in the netrin-1 receptor gene Unc5h3."; 

RL Brain Res. Mol. Brain Res. 122:103-108(2004). 

RN [2] 

RP FUNCTION. 

RX PubMed=11387206; DOI=10 . 1093/emboj /20 . 11 . 2715; 

RA Llambi F., Causeret F. , Bloch-Gallego E., Mehlen P.; 

RT "Netrin-1 acts as a survival factor via its receptors UNC5H and DCC"; 

RL EMBO J. 20:2715-2722(2001). 

CC FUNCTION: Receptor for netrin required for axon guidance. Mediates 

CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. Also involved in corticospinal tract axon guidances 

CC independently of DCC. It also acts as a dependence receptor 

CC required for apoptosis induction when hot associated with netrin 

CC ligand. 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC (By 

CC similarity) . 

CC SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 



CC -!- TISSUE SPECIFICITY: Mainly expressed in brain. Also expressed in 

CC kidney. Not expressed in developing or adult lung. 

CC -!- PTM: Phosphorylated on different cytoplasmic tyrosine residues. 

CC Phosphorylation of Tyr-568 leads to an interaction with PTPN11 

CC phosphatase, suggesting that its activity is regulated by 

CC phosphorylation/dephosphorylation. Tyrosine phosphorylation is 

CC netrin-dependent (By similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 
CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis . 

CC DISEASE: Defects in Unc5c are the cause of cerebellar vermis 

CC defect (cvd) and hobble (hob) phenotypes . Cvd and hob rats exhibit 

CC cerebellar and midbrain defects, possibly as a result of abnormal 

CC neuronal migration, and exhibit laminar structure abnormalities in 

CC the fused cerebellar hemispheres and ectopic cerebellar tissues in 

CC the cerebello-pontine junction. 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AB118026; BAD05181.1; -. 

DR RGD; 735109; UncSc. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2 . 

DR InterPro; IPR000884; TSPl. 

DR InterPro; IPR008085; TSP_1. 

DR InterPro; IPR000906; ZU5. 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. . 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01705; TSP1REPEAT. 

DR SMART; SM00005; DEATH; 1. 

DR SMART; SM00409; IG; 1. 

DR SMART; SM00408; IGc2; 1. 

DR SMART; SM00209; TSPl; 2. 

DR SMART; SM00218; ZU5; 1. 

DR PROSITE; PS50017; DEATH_DOMAIN ; FALSE_NEG. 

DR PROSITE; PS50835; IG_LIKE; 1. 

DR PROSITE; PS50092; TSPl; 2. 

KW Apoptosis; Developmental protein; Immunoglobulin domain; 

KW Phosphorylation; Receptor; Repeat; Signal; Transmembrane. 
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Cleavage (by caspase-3) (By similarity) . 
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Interaction with DCC (By similarity) . 
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83 
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Phosphotyrosine (By similarity) . 


FT 


CARBOHYD 


236 


236 


N-linked (GlcNAc. . . ) (Potential). 


FT 


CARBOHYD 


361 


361 


N-linked (GlcNAc. . .) (Potential). 


SQ 


SEQUENCE 


931 AA; 


103134 


MW; 25B183A97BCB8401 CRC64; 



Query Match 59.3%; Score 2762.5; DB 1; Length 931; 

Best Local Similarity 57.9%; Pred. No. 3.5e-199; 

Matches 510; Conservative 151; Mismatches 197; Indels 23; Gaps 7; 

Qy 11 PGANPDLLPHFLVTSPEDVYIVT^KPVXLVCKAV 70 

I I : 11111:111: I I II I I I I I III I I I I I : I I I I III I III:: I 
Db 54 PSDPPEPLPHFLIEPEEAYIV1<NKPWLYCKASPATQIYFKCNSEWVHQKDHVAADERVDE 113 

Qy 71 SSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEP 130 

: I I I II I : I I I I I I : : I I I : I I I II I I I I I : I I I I I : I I I : I I I III I I I I I 
Db 114 TSGLIVREVSIEISRQQVEELFGPEDYWCQCVAWSSAGTTKSRKAYVRIAYLRKTFEQEP 173 

Qy 131 LAKEVSLEQGIVXPCRPPEGIPPAEvTSWLRNEDLVT^ 190 

I I I I I I I I : : I II I I I I I I I I I I I I : I I I :: I I I I III : I : I : : : I I I I : I 
Db 174 LGKEVSLEQEVLLQCRPPEGIPMAEVEWLKNEDIIDPVEDRNFYITIDHNLIIKQARLSD 233 

Qy 191 TANYTCVAKNIVARRRSASAAVIVTWGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPL 250 

I I I I I I I I I I I I I : I : I : I I I I I I I I I I I I I INN:: I I I I : I I I : I : I I I I I I I 
Db 234 TANYTCVAKNIVAKRKSTTATVIWWGGWSTWAEWSVCNSRCGRGYQKRTRTCTNPAPL 293 

Qy 251 NGGAFCEGQNVQKTACATLCPVTDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQ 310 

I I I I I IJ I I : I I I II I I I I I I I I : I I I II II : I I II I III: I I I : I I I : : I 
Db 294 NGGAFCEGQSVQKIACTTLCPVDGRWTSWSKWSTCGTECTHWRRRECTAPAPKNGGKDCD 353 

Qy 311 GTDLDTRNCTSDLCVHSASGPEDVALYVGL-IAVAVCLVLLLLVLILVYCRKKEGLDSDV 369 

I I :: I I I I I: : I : I I I I I I I : I I I I I I : : : I : I I : : I : : 

Db 354 GLVXQSKNCTDGLCMQAAPDSDDVALYVGIVIAVTVCLAITVWALFVTRKNHR 413 

Qy 370 ADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGH 429 

Ml I I I I I I : I I : : I I I : I I I :: I : I : I II Ml 
Db 414 IDS S ALNGG FQ P VN I KAARQ D LLAVPPDLTSAAAMYRGPVYALHD-VSDKIPMTNSP 469 

Qy 430 LLS PLGGGRHTLHHS S PTSEAEEFVSRLS TQNYF RSLPRGT — S 471 

Mil : ::M I I : I I IM I II : M l I I I 

Db 470 ILDPLPNLKIK\A^SSGAVTPQDDIJ^FSSKLSPQMTQSLLENEALNLKNQSLARQTDPS 529 

Qy 472 NMTYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLS 531 



Db 



: I I I I III I : I I I : I : I I I I i I II : I : : I I : I : I : I : I : : I I : INI: 
530 CTAFGTFNSLGGHLI IPNSGVSLLI PAGAI PQGRVYEMYVTVHRKENMRPPMEDSQTLLT 589 



Qy 532 PIVSCGPPGVLLTRPVT LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYY 591 

I : I I I I I I I I I I I I I I I : I I : I : : I : : I I I : : I I I I I : : I I I : I 
Db 590 PWSCGPPGALLTRPVILTLHHCADPNTEDWKIQLKNQAVQGQWEDWWGEENFTTPCY 649 

Qy 592 CQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHD 651 

I I : I II:: II I : I I I I : : : I I I II I I I : I I : I : I I I I : I I I I I I II I 
Db 650 IQLDAEACHILTENLSTYALVGQSTTKAAAKRLKLAI FGPLCCSSLEYSIRVYCLDDTQD 709 

Qy 652 ALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHI 711 

I I I I I : I I I : I : I I I I :: I I : I I I I I MINIMI: || M II II I II M II II 
Db 710 ALKEVLQLERQMGGQLLEEPKALHFKGSIHNLRLSIHDITHSLWKSKLLAKYQEIPFYHI 769 

Qy 712 WNGTQRYLHCTFTLERVSPSTSDIACKLWVWQVEGDGQSFSINFNITKDTRFAELLALES 771 

I : I : II II I II I I II : I : I : I I M I M I I : I! I : I : : : : : I I : 
Db 770 WSGSQRNLHCTFTLERLSLNTVELVCKLCVRQVEGEGQIFQLNCTVSEEPTGIDLPLLDP 829 

Qy 772 EAGVPALVGPSAFKI P FLI RQKI I S SLDP PCRRGADWRTLAQKLHLDSHLS FFAS KPS PT 831 

: : : I I I I I I I I M I : I II I I I I II I M M : I I : I : : I I : I M I 
Db 830 ASTITTVTGPSAFSIPLPIRQKLCSSLDAPQTRGHDWRMLAHKLNLDRYLNYFATKSSPT 889 

Qy 832 AMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAE 872 

: II : I I I I : : II : I I M I I I : : I : : : : I : 
Db 890 GVILDLWEAQNFPDGNLSMLAAVXEEMGRHETVVSLAAEGQ 930 



RESULT 6 
UN5C CHICK 



ID UN5C_CHICK STANDARD; PRT; 931 AA. 

AC Q7T2Z5; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5C precursor (Unc-5 homolog C) (Unc-5 homolog 3) 

DE (CUNC-5H3) . 

GN Name=UNC5C; 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N. A. , AND TISSUE SPECIFICITY. 

RX PubMed=12799087; 

RA Guan W., Condic M.L.; 

RT "Characterization of Netrin-1, Neogenin and cUNC-5H3 expression during 

RT chick dorsal root ganglia development."; 

RL Gene Expr. Patterns 3:369-373(2003). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- TISSUE SPECIFICITY: Restricted to proprioceptive neurons. 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 
CC similarity) . 



CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 
CC SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AY187310; AA067275.1; -. 

DR InterPro; IPR000488; Death. ' 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2. 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR000906; ZU5. 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR SMART; SM00005; DEATH; 1. 

DR SMART; SM00408; IGc2; 1. 

DR SMART; SM00209; TSP1; 2. 

DR SMART; SM00218; ZU5; 1. 

DR PROSITE; PS50017; DEATH_DOMAIN ; FALSE_NEG. 

DR PROSITE; PS50835; IG_LIKE; 1. 

DR PROSITE; PS50092; TSP1; 2. 

KW Developmental protein; Immunoglobulin domain; Phosphorylation; 

KW Receptor; Repeat; Signal; Transmembrane. 

FT SIGNAL 1 39 Potential. 

FT CHAIN 40 931 Netrin receptor UNC5C. 

FT DOMAIN 40 380 Extracellular (Potential). 

FT TRANSMEM 381 401 Potential. 

FT DOMAIN 402 931 Cytoplasmic (Potential). 

FT DOMAIN 62 159 Ig-like. 

FT DOMAIN 161 256 Ig-like C2-type. 

FT DOMAIN 260 314 TSP type-1 1. 

FT DOMAIN 316 368 TSP type-1 2. 

FT DOMAIN 528 631 ZU5 . 

FT DOMAIN 850 929 Death. 

FT DISULFID 83 142 By similarity. 

FT DISULFID 188 239 By similarity. 

FT CARBOHYD 236 236 N-linked (GlcNAc. 

FT CARBOHYD 361 361 N-linked (GlcNAc. 

SQ SEQUENCE 931 AA; 



) (Potential) 
) (Potential) 



102906 MW; 1E23A0D84F2E2C62 CRC64; 



Query Match 59.0%; Score 2750.5; DB 1; 

Best Local Similarity 57.8%; Pred. No. 2.8e-198; 
Matches 508; Conservative 150; Mismatches 198; 



Length 931; 
Indels 23; Gaps 



7; 



Qy 



11 PGANPDLLPHFLV^PEDWIVXNKPVXLVCKAVPATQIFFKCNGEWVl^QVTDHVIERSTDG 70 
I I : I I I I I : I I I : I I I I I I I I I III I I I I I : I I II III I III:: I 



54 PSDPPEPLPHFLIEPEEAYIVKNKPVNLYCKASPATQIYFKCNSEWVHQKDHWDERVDE 113 



71 SSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEP 130 
: I I I II I : I I I I I I :: I I I : I I I I I I I I I I : I I I I I : I I I : II I III I I I I I 
114 T SGLI VCEVS I EI SRQQVEELFGPEDYWCQCVAWS SAGTTKS RKAYVRI AYLRKTFEQEP 173 

131 LAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNWITREHSLVVRQARLM 190 

I II I I I I I : : I I I I I I I I I II I I I I : I I : : : I I I I III : I : I : : : I I I I : I 
174 LGKEVSLEQEVLLQCRPPEGI PYAEVEWLKNEEVI DPVEDRNFYITI DHNLI I KQARLSD 233 

191 TANYTCVAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPL 250 

I I I I I I I I I I I I I : I : I : I I I I I I I I I I I I I I I I I I : : I I I I : I I I : I : I I I I I I I 
234 TANYTCVAKNIVAKRKSTTATVIVYVNGGWSTWTEWSACNSRCGRGFQKRTRTCTNPAPL 293 

251 NGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECO 310 

I I I I I II I I I I I I II I I I I I I I I : I I I I I II : I II I I III: I I I : I I I :: I : 
294 NGGAFCEGQNVQKIACTTLCPVDGKWTSWSKWSTCGTECTHWRRRECTAPAPKNGGKDCE 353 

311 GTDLDTRNCTSDLCVHSASGPEDVALYVGL-IAVAVCLVLLLLVLILVYCRKKEGLDSDV 369 

I I : : I I I I I : : I : I I I I I I I : I I I I I I : : : I : I I : : I I : 

354 GLVLQSKNCTDGLCMQAAPDSDDVALYVGIVIAVIVCLAISVWALFVYRKNHRDFESDI 413 

370 ADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGH 429 

I I I I I I I I I : I I : : I I I : I I I : : I : I : I II : I I 
414 IDSSALNGGFQPVNIKAARQD ^LLAVPPDLTSAAAMYRGPVYALHD-VSDKIPMTNSP 469 

430 LLS PLGGGRHTLHHS S PTSEAEEFVSRLS TQNYF RSLPRGT — S 471 

till : : : : : I I I : I I : I I I I : : I I I I I 

470 ILDPLPNLKIKVYNTSGAVTPQDELSDFSSKLSPQITQSLLENETLNVKNQSLARQTDPS 529 

472 NMTYGTFNFLGGRLMI PNTGI SLLIPPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLS 531 

: I I I I III I : I I I : I : I I I I I I : I : I : : I I : I : I : I : I : I I : INI: 
530 CTAFGTFNSLGGHLVI PNSGVSLLIPAGAVPQGRVYEMYVTVHRKEGMRPPVEDSQTLLT 589 

532 PIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYY 591 

I : I I I I I I I I I I I I I : I I I I I I : I I :: I I I :: I I I I I :: I I I : I 
590 PWSCGPPGALLTRPVVLTlfflHCAEPNMDDWQIQLKHQAGQGPWEDVVVVGEENFTTPCY 649 

592 CQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHD 651 

II: II:: II I : I I I I :::: I I I I I I I I : I I : : I : I I I I : I I I I I I II I 
650 IQLDPEACHILTETLSTYALVGQSITKAAAKRLKLAIFGPLSCSSLEYSIRVYCLDDTQD 709 

652 ALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHI 711 

I I I I I : I I I : I : I I I I : : I I : I I I I I MINIMI: I I 1 I I I I I I I II I I I I 
710 ALKEVLQLERQMGGQLLEEPKTLHFKGSTHNLRLSIHDIAHSLWKSKLPAKYQEIPFYHI 769 

712 WNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALES 771 

I : I I I I I I I I I I I I I : I : I I I I I I I II : I I I : I : : : : : : : : I 
770 WSGCQRNLHCTFTLERFSLNTLELVCKLCVRQVEGEGQIFQLNCSVSEEPTGIDYPIMDS 829 

772 EAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPT 831 

: : I I I : I I I I I M I : I I I I I I I I M M I I I I : I : : I I : I I I I 
830 AGSITTIVGPNAFSIPLPIRQKLCSSLDAPQTRGHDWRMLAHKLKLDRYLNYFATKSSPT 889 

832 AMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSE 870 

: I I : I I I I :: M : M I I I M : : I : : : : I 
890 GVT LDLWEAQNFPDGNLSMLAAVLEEMGRHET WS LAAE 928 



RESULT 7 
UN5C_HUMAN 

ID UN5C_HUMAN STANDARD; PRT; 931 AA. 

AC 095185; Q8IUT0; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5C precursor (Unc-5 homolog C) (Unc-5 homolog 3) . 

GN Name=UNC5C; Synonyms=UNC5H3 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

.RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1), AND TISSUE SPECIFICITY. 

RC TISSUE=Brain; 

RX MEDLINE=99000841; PubMed=97 82087 ; DOI=10 . 1006/geno . 1998 . 5425 ; 

RA Ackerman S.L., Knowles B.B.; 

RT "Cloning and mapping of the UNC5C gene to human chromosome 4q21-q23."; 

RL Genomics 52:205-208(1998). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RC TISSUE=Lung; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [3] 

RP DOWN-REGULATION IN CANCER. 

RX PubMed=12655055; DOI=10 . 1073/pnas . 0738063100; 

RA Thiebault K., Mazelin L. , Pays L., Llambi F. , Joly M.-O., 

RA Scoazec J.-Y., Saurin J.-C, Romeo G., Mehlen P.; 

RT "The netrin-1 receptors UNC5H are putative tumor suppressors 

RT controlling cell death commitment."; 

RL Proc. Natl. Acad. Sci. U.S.A. 100:4173-4178(2003). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 



CC caused by its association with DCC that may trigger signaling for 

CC repulsion. Also involved in corticospinal tract axon guidances 

CC independently of DCC. It also acts as a dependence receptor 

CC required for apoptosis induction when not associated with netrin 

CC ligand (By similarity) . 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC Isold=095185-1; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=095185-2; Sequence=VSP_011700, VSP_011701; 

CC TISSUE SPECIFICITY: Mainly expressed in brain. Also expressed in 

CC kidney. Not expressed in developing or adult lung. 

CC -!- PTM: Phosphorylated on different cytoplasmic tyrosine residues. 

CC Phosphorylation of Tyr-568 leads to an interaction with PTPN11 

CC phosphatase, suggesting that its activity is regulated by 

CC phosphorylation/dephosphorylation. Tyrosine phosphorylation is 

CC netrin-dependent (By similarity) . 

CC PTM: Proteolytically cleaved by caspases during apoptosis. The 

CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis (By similarity) . 

CC MISCELLANEOUS: Down-regulated in multiple cancers including 

CC colorectal, breast, ovary, uterus, stomach, lung, or kidney 

CC cancers. 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

cc 

DR EMBL; AF055634; AAC67491.1; -. 

DR EMBL; BC041156; AAH41156.1; -. 

DR HSSP; P07996; 1LSL. 

DR Genew; HGNC: 12569; UNC5C. 

DR MIM; 603610; -. 

DR GO; GO: 0005042; F:netrin receptor activity; TAS. 

DR GO; GO: 0007411; P:axon guidance; TAS. 

DR GO; GO: 0007420; P: brain development; TAS . 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2. 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR008085; TSP 1. 



DR InterPro; IPR000906; ZU5. 

DR Pfam; PF00531; Death; 1. . 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01705; TSP1REPEAT. 

DR SMART; SM00005; DEATH ; 1. 

DR SMART; SM00408; IGc2; 1. 

DR SMART; SM00209; TSP1; 2. 

DR SMART; SM00218; ZU5; 1. 

DR PROSITE; PS50017; DEATH_DOMAIN; FALSE_NEG. 

DR PROSITE; PS50835; IG_LIKE; 1. 

DR PROSITE; PS50092; TSP1; 2. 

KW Alternative splicing; Apoptosis; Developmental protein; 

KW Immunoglobulin domain; Phosphorylation; Polymorphism; Receptor; 

KW Repeat; Signal; Transmembrane. 



FT 


SIGNAL 


1 


40 


Potential . 


FT 


CHAIN 


41 


931 


Netrin receptor UNC5C. 


FT 


DOMAIN 


41 


380 


Extracellular (Potential) . 


FT 


TRANS MEM 


381 


401 


Potential . 


FT 


DOMAIN 


402 


931 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


62 


159 


Ig-like. 


FT 


DOMAIN 


161 


256 


Ig-like C2-type. 


FT 


DOMAIN 


260 


314 


TSP type-1 1. 


FT 


DOMAIN 


316 


368 


TSP type-1 2. 


FT 


DOMAIN 


528 


631 


ZU5. 


FT 


DOMAIN 


850 


929 


Death. 


FT 


SITE 


415 


416 


Cleavage (by caspase-3) (By similarity) . 


FT 


SITE 


694 


712 


Interaction with DCC (By similarity) . 


FT 


DISULFID 


83 


142 


By similarity. 


FT 


DISULFID 


188 


239 


By similarity. 


FT 


MOD_RES 


568 


568 


Phosphotyrosine (By similarity) . 


FT 


CARBOHYD 


236 


236 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


361 


361 


N-linked (GlcNAc. . .) (Potential). 


FT 


VARSPLIC 


370 


370 


T -> SFIYPISTEQRTQNEYGFSS (in isoform 2) 


FT 








/ FTId=VS P_0 11700. 


FT 


VARSPLIC 


579 


931 


Missing (in isoform 2). 


FT 








/FTId=VSP_011701. 


FT 


VARIANT 


37 


37 


G -> V (in dbSNP:2306715) . 


FT 








/FTId=VAR_019731 . 


FT 


VARIANT 


721 


721 


T -> M (in dbSNP:2289043) . 


FT 








/FTId=VAR_019732 . 


FT 


CONFLICT 


219 


219 


T -> I (in Ref . 1) . 


FT 


CONFLICT 


489 


489 


S -> T (in Ref. 1) . 


SQ 


SEQUENCE 


931 AA; 


103101 


MW; EFD71122C98DABB8 CRC64 ; 



Query Match 59.0%; Score 2747.5; DB 1; Length 931; 

Best Local Similarity 57.3%; Pred. No. 4.8e-198; 

Matches 505; Conservative 151; Mismatches 202; . Indels 23; Gaps 7; 

Qy 11 PGANPDLLPHFLVEPEDWIVlCNKPvXLVCKAVPATQIFFKCNGEWvT^QVDHVIERSTD^ 70 

I I : I I I I I : I I I : I I I I I II I I III I I I I I : I I I I III I II::: I 
Db 54 PSDPPEPLPHFLIEPEEAYIVl^NKPWLYCKASPATQIYFKCNSEWVHQKDHIVDERVDE 113 



Qy 

Db 



71 SSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEP 130 
: I I I II ! : I I I I I I : : I I I : I I I I I I I I I I : I I I I I : I I I : I I I III I I I I I 
114 TSGLIVREVSIEISRQQVEELFGPEDYWCQCVAWSSAGTTKSRKAYVRIAYLRKTFEQEP 173 



Qy 131 LAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPim^ITREHSLVWQARLAD 190 

I I I I I I I I :: I I I I I I I I I I I I I I I : I I I :: I I I I t I I : I : I : : : I I I I : I 
Db 174 LGKEVSLEQEVLLQCRPPEGIPVAEVEWLKNEDIIDPVEDRNFYITIDHNLIIKQARLSD 233 

Qy 191 TANYTCVAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPL 250 

I I I I I I I I I I I I I : I : I : I I I I I I I I I I I I I I I I I I I : : I I I I : I M : I : I J I I I I I 
Db 234 TANYTCVAKNIVAKRKSTTATVIVYWGGWSTWTEWSVCNSRCGRGYQKRTRTCTNPAPL 293 

Qy 251 NGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQ 310 

I I I I I I I I I : I I I II I I I I I I I hllllll II : I I I I I III: I I I : I I I :: I 
Db 294 NGGAFCEGQSVQKIACTTLCPVDGRWTPWSKWSTCGTECTHWRRRECTAPAPKNGGKDCD 353 

Qy 311 GTDLDTRNCTSDLCVHSASGPEDVALWGL-IAVAVCLVLLLLVLILVYCRKKEGLDSDV 369 

I I : : I I I I I : : I : I I I I I I I : I I I I I I : : : I : I I : : I I : 

Db 354 GLVLQSKNCTDGLCMQTAPDSDDVALYVGIVIAVIVCLAISVWALFVYRKNHRDFESDI 413 

Qy 370 ADSS I LTSGFQPVS I KPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS PKFQLTNGH 429 

III I llllhll :: I II : III:: hi : I II :|| 
Db 414 I DS SALNGGFQPVNI KAARQD LLAVPPDLTSAAAMYRGPVYALHD-VSDKIPMTNSP 469 

Qy 430 LLS PLGGGRHTLHHS SPTSEAEEFVSRLS TQNYF RSLPRGT — S 471 

: I I I : : : : : I I : I I I : I I I I : HI I I I 

Db 470 ILDPLPNLKIKVYNTSGAVSPQDDLSEFTSKLSPQMTQSLLENEALSLKNQSLARQTDPS 529 

Qy 472 l^TYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLS 531 

: I : I I III I :: I I : I : I I I I I I I I : I : : I I : I : I : I : I : I I : MM: 
Db 530 CTAFGSFNSLGGHLIVPNSGVSLLIPAGAIPQGRVYEMYVTVHRKETMRPPMDDSQTLLT 589 

Qy 532 PIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYY 591 

I : I II I I I I I II I I I : I I I I : I : : I : II I : : I M I I : : I I I : I 
Db 590 PWSCGPPGALLTRPVVLTMHHCADPNTEDWKILLKNQAAQGQWEDVVWGEENFTTPCY 649 

Qy 592 CQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHD 651 

: 1:1 II:: II I : II I I : : II I I I I I I : I I : I : I I II : I I I I II I I I 
Db 650 IKLDAEACHILTENLSTYALVGHSTTKAAAKRLKLAI FGPLCCSSLEYSIRVYCLDDTQD 709 

Qy 652 ALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHI 711 

I I I I : : M : I I I I I : : II : I I II I II I I I I I I I : II I I II I I I II I I I I I : 
Db 710 ALKEILHLERQTGGQLLEEPKALHFKG 769 

Qy 712 WNGTQRYLHCTFTLERVSPSTSDIACKLWVWQVEGDGQSFSINFNITKDTRFAELLALES 771 

I : I : I I I I I II I M I I : I : I I II I I II I : I I I : I : : :: : I I : 
Db 770 WSGSQRNLHCTFTLERFSLNTVELVCKLCVRQVEGEGQIFQLNCTVSEEPTGIDLPLLDP 829 

Qy 772 EAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRG7VDWRTLAQKLHLDSHLSFFASKPSPT 831 

: : I II I I I I II II : II I I T II II I I I I I : II : I :: I I : I II I 
Db 830 ANTITTVTGPSAFSIPLPIRQKLCSSLDAPQTRGHDWRMLAHKLNLDRYLNYFATKSSPT 889 

Qy 832 AMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAE 872 

: I I : I I I I : : I I : I II I I I I : : I : : : : I : 
Db 890 GVILDLWEAQNFPDGNLSMLAAVLEEMGRHETWSLAAEGQ 930 



RESULT 8 
UN5B_XENLA 

ID UN5B XENLA STANDARD; PRT; 943 AA. 



\ 



AC Q8JGT4; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5B precursor (UNC-5 homolog) (Protein XUNC-5) . 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Amphibia; Batrachia; Anura; Mesobatrachia; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCBI_TaxID=8355; 

RN [1] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RX MEDLINE=22239703; PubMed=12351179 ; DOI=10 . 1016/S0925-4773 ( 02 ) 00215-0 ; 

RA Anderson R.B., Holt C.E.; 

RT "Expression of UNC-5 in the developing Xenopus visual system."; 

RL Mech. Dev. 118:157-160(2002). 

CC FUNCTION: Receptor for netrin required for axon guidance. Mediates 

CC axon repulsion, of neuronal growth cones in the developing nervous 

CC system upon ligand binding (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- TISSUE SPECIFICITY: In the developing visual system, it is 

CC expressed within the developing optic vesicles and later become 

CC restricted to the dorsal ciliary marginal zone, a site of 

CC retinoblast proliferation and differentiation. 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 

CC similarity) . 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC SIMILARITY: Contains 1 immunoglobulin- like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 
CC SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AY099459; AAM34486.1; 

DR HSSP; P07996; 1LSL. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2 . 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR008085; TSP_1. 

DR InterPro; IPR000906; ZU5. 



DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01705; TSP1REPEAT. 

DR SMART; SM00005; DEATH; 1. 

DR SMART; SM00408; IGc2; 1. 



DR SMART; SM00209; TSP1; 2. 



DR 


SMART; SM00218; ZU5; 1. 






DR 


PROSITE; 


PS50017; 


DEATH 


DOMAIN; FALSE NEG. 


DR 


PROSITE; 


PS50835; 


IG LIKE; 


1. 


DR 


PROSITE; 


PS50092; 


TSP1; 


2. 




KW 


Developmental protein; 


Immunoglobulin domain; Phosphorylation; 


KW 


Receptor; 


Repeat; 


Signal ; Transmembrane . 


FT 


SIGNAL 


1 


30 




Potential . 


FT 


CHAIN 


31 


943 




Netrin receptor UNC5B. 


FT 


DOMAIN 


31 


380 




Extracellular (Potential) . 


FT 


TRANSMEM 


381 


401 




Potential. 


FT 


DOMAIN 


402 


943 




Cytoplasmic (Potential) . 


FT 


DOMAIN 


51 


148 




Ig-like. 


FT 


DOMAIN 


150 


245 




Ig-like C2-type. 


FT 


DOMAIN 


249 


303 




TSP type-1 1. 


FT 


DOMAIN 


305 


357 




TSP type-1 2. 


FT 


DOMAIN 


540 


643 




ZU5. 


FT 


DOMAIN 


863 


941 




Death. 


FT 


DISULFID 


72 


131 




By similarity. 


FT 


DISULFID 


177 


228 




By similarity. 


FT 


CARBOHYD 


225 


225 




N-linked (GlcNAc. . .) (Potential) 


FT 


CARBOHYD 


350 


350 




N-linked (GlcNAc. . .) (Potential) 


SQ 


SEQUENCE 


943 AA; 


105083 


MW; A024E24A7EDB6175 CRC64 ; 


Query Match 




56 


.6%; 


Score 2639; DB 1; Length 943; 



Best Local Similarity 53.8%; Pred. No. 7.4e-190; 

Matches 491; Conservative 160; Mismatches 221; Indels 40; Gaps 7; 

Qy 1 QQS ATVAN P VP GAN P D L L P H FLVE P EDVY I VKN K P VLLVC KAVP ATQ I FFKCN GEWVRQV 60 

: I : : I I : I I I I I : I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I 
Db 33 EYSD\a,PDSFPSAPAESLPHFLLEPEDAYIVKNKPVELVCKANPATQIYFKCNGEWVNQN 92 

Qy 61 DHVIERSTDGSSGLPTMEVl^INVSRQQVEKWGLEEYWCQCVAWSSSGTTKSQKAYIRIA 120 

II: : I : I I I I : I I I I I I I I : : I I I I : I I I I I I I I I I : I I I I I :: : I : I I I 
Db 93 DHITKERVl)DVTGLvVREVQIEVSRQQVEELFGLEDYWCQCVAWSSAGTTKSKRSYVRIA 152 

Qy 121 RLRKNFEQEPLAKEVSLEQGIVTiPCRPPEGIPPAEV^WLRNEDLVDPSLDPNVYITREHS 180 

I I II I : I I I I I I I : I I I : I I I I I II : II I I I I I I : I I : : : I I : J I II : I : 
Db 153 YLRKNFDQEPLGKEVALEQEALLQCRPPEGVPPAEVEWLKNEEIIDPTKDTNFLITIDHN 212 

Qy 181 LWRQARLADTANYTCVAKNIVARRRSASAAVIW 240 

I ::: I I I I : I I I.I I I I I : I I I I I : I I I : I I I I : I I I I I I : I I I I I I : II I I I I I 
Db 213 LIIKQARLSDTANYTCVSKNIVAKRRSTTATVIVFVNGGWSSWTEWSPCNNRCGHGWQKR 272 

Qy 241 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 300 

: I : I I I I I I I I I I I I I I II II I : I I I I I I : I I I I I I I : I I I I I I I I I : I 
Db 273 TRTCTNPAPLNGGTMCEGQQYQKFACNTMCPVDGGWTEWSKWSACSTECTHWRSRECNAP 332 

Qy 301 APRNGGEECQGTDLDTRNCTSDLCVHS AS GPEDVALYVGL- 1 AVAVCLV 348 

|:|||::| I ll::||| II: : I I I I I I I : I : : : : 

Db 333 TPKNGGKDCSGMLLDSKNCTDGLCMQNKRVLGETKSRLLESTGDVALYAGLWAIFIVII 392 

Qy 349 LLLLVLILVYCRKKEGLDSDVADSS-ILTSGFQPVSIKPSKADNPHLL — TIQPDLSTTT 405 

II: I 1:11 I 1:1: III II II J I : I I: II J: :|||.|: 

Db 393 LLMAVGIWYRRNCREFDTDITDSSAALTGGFHPVNFKTSRHDNSQLIHPAMQPDLTANA 452 



Qy 



406 TTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSS 



PT 447 



I : I : : I I : I : I I I I I I : : : : I I II 

Db 453 GIYRGNMYALQDS-ADKIPMTNSPLLDPLPNLKIKVYNSSTVGSSPGIHDGNNLLGTKPT 511. 

Qy 448 SEAEEFVSRLSTQN YFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAI 501 

: : : : I : : I I I : I I III I I I I I I I I I : II I I I II 

Db 512 GTYPSDNNIMNARNKNMSMQHLLTLPRDSSNSVTGTFGSLGGRLTFPNTGVSLLIPQGAI 571 

Qy 502 PRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDS 561 

I : I I I I : I I :: I I : II I I I : I I I I : : I I I I : I I : I I I I : I I : : 
Db 572 PQGKYYEMYLMINKRENTVLP S EGTQTI LS P I ITCGPTGLLLCKPVI LTVPHCADINTS D 631 

Qy 562 WSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAA 621 

I I : I I I I : I : I I : I : I I I : I I I I I : : I : : I I I : I I I I : I : I 
Db 632 WILQLKTQSHQGNWEEVVTLNEETLNTPCYCQLESHSCHTLLDQLGTYAFVGESYSRSAI 691 

Qy 622 KRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYH 681 

I I I : I : I I I : I I I I I I I : : I I I : II I I I I I I : : I I I III I : : I I : : I I I I I I I 
Db 692 KRLQLAIFAPMLCTSLEYNLKVYCVEDTPDALK 751 

Qy 682 NLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWV 741 

I I I I I I I I : I 111:111: I I I I I I I I I I : I : I I I I I I I I I I I I : : : I I I : I 
Db 752 NLRLSIHDIPHSLWRSKLMAKYQEIPFYHIWSGSQRTLHCTFTLERYSLAATELTCKICV 811 

Qy 742 WQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPP 801 

I I I I : I I I : : : : : : : : : I I : I I I I I I I I I I I I I : I I I I 
Db 812 RQVEGEGQIFQLHTLLEENVKSFDPFCSQAENSVTTHLGPYAFKIPFSIRQKICNSLDAP 871 

Qy 802 CRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQP 861 

II III I I I I I : I : I : : I I : I III : I I : I I I I I : I : I : I I : I : : I : 

Db 872 NSRGNDWRLLAQKLCMDRYLNYFATKASPTGVILDLWElALHQDDGDLNTIiASALEEMGKS 931 

Qy 862 DAGLFTVS EAEC 873 

: I : : : I 
Db 932 EMMLVMATDGDC 943 



RESULT 9 
UN5B_MOUSE 

ID UN5B_MOUSE STANDARD; PRT; 945 AA. 

AC Q8K1S3; Q6PFH0; Q80Y85; Q9D398; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5B precursor (Unc-5 homolog B) (Unc-5 homolog 2) . 

GN Name=Unc5b; Synonyms=Unc5h2 ; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1) , AND TISSUE SPECIFICITY. 

RX MEDLINE=22239710; PubMed=12351186; DOI=10 . 1016/S0925-4773 ( 02 ) 00248-4 ; 

RA Engelkamp D. ; 

RT "Cloning of three mouse unc-5 genes and their expression patterns at 

RT mid-gestation . " ; 

RL Mech. Dev. 118:191-197(2002). 

RN [2] 



RP SEQUENCE FROM N.A. (ISOFORM 1). 

RC STRAIN=C57BL/6J; TISSUE=Medulla oblongata; 

RX MEDLINE=22354683; PubMed=124 66851 ; DOI=10 . 1038/nature01266; 

RA Okazaki Y., Furuno M. , Kasukawa T., Adachi J., Bono H., Kondo S., 

RA Nikaido I., Osato N . , Saito R. , Suzuki H., Yamanaka I., Kiyosawa H., 

RA Yagi K., Tomaru Y. , Hasegawa Y., Nogami A. , Schonbach C, Gojobori T., 

RA Baldarelli R. , Hill D.P., Bult C. , Hume D.A., Quackenbush J. , 

RA Schriml L.M., Kanapin A., Matsuda H. , Batalov S., Beisel. K.W., 

RA Blake J. A., Bradt D., Brusic V. , Chothia C, Corbani L.E., Cousins S., 

RA Dalla E., Dragani T.A. , Fletcher C.F., Forrest A., Frazer K.S., 

RA Gaasterland T., Gariboldi M. , Gissi C, Godzik A., Gough J., 

RA Grimmond S., Gustincich S., Hirokawa N., Jackson I.J., Jarvis E.D., 

RA Kanai A. , Kawaji H., Kawasawa Y., Kedzierski R.M., King B.L., 

RA Konagaya A., Kurochkin I.V., Lee Y., Lenhard B., Lyons P. A. , 

RA Maglott D.R., Maltais L., Marchionni L., McKenzie L., Miki H., 

RA Nagashima T., Numata K. , Okido T., Pavan W.J., Pertea G., Pesole G., 

RA Petrovsky N., Pillai R. , Pontius J.U., Qi D., Ramachandran S., 

RA Ravasi T., Reed J.C., Reed D.J., Reid J. f Ring B.Z., Ringwald M. , 

RA Sandelin A. , Schneider C, Semple C.A., Setou M. , Shimada K., 

RA Sultana R. , Takenaka Y., Taylor M.S., Teasdale R.D., Tomita M. , 

RA Verardo R., Wagner L. , Wahlestedt C, Wang Y., Watanabe Y., Wells C. , 

RA Wilming L.G. f Wynshaw-Boris A. , Yanagisawa M. , Yang I., Yang L., 

RA Yuan Z. r Zavolan M. , Zhu Y. f Zimmer A., Carninci P., Hayatsu N., 

RA Hirozane-Kishikawa T., Konno H., Nakamura M. , Sakazume N. f Sato K., 

RA Shiraki T . , Waki K. , Kawai J., Aizawa K., Arakawa T., Fukuda S. f 

RA Hara A. , Hashizume W., Imotani K. , Ishii Y., Itoh M. , Kagawa I. r 

RA Miyazaki A., Sakai K. , Sasaki D., Shibata K. , Shinagawa A., 

RA Yasunishi A., Yoshino M. , Waterston R. , Lander E.S., Rogers J. , 

RA Birney E., Hayashizaki Y. ; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; ' 

RL Nature 420:563-573(2002) . 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RC STRAIN=C57BL/6; TISSUE=Brain; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H. f Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA TMtschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A. , Peters G.J. , Abramson R.D., Mullahy S.J., 

RA Bosak S.A. , McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA. Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E., Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation "and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [4] 



RP TISSUE SPECIFICITY. 

RX PubMed=12799072; 

RA Dalvin S., Anselmo M.A. , Prodhan P., Komatsuzaki K. , Schnitzer J. J., 

RA Kinane T.B. ; 

RT "Expression of Netrin-1 and its two receptors DCC and UNC5H2 in the 

RT developing mouse lung."; 

RL Gene Expr. Patterns 3:279-283(2003). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand (By 

CC similarity) . 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC. Interacts 
CC with GNAI2 via its cytoplasmic part (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC IsoId=Q8KlS3-l; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=Q8KlS3-2; Sequence=VSP_011699; 

CC -!- TISSUE SPECIFICITY: Highly expressed in brain. Expressed in lung 
CC during late development. Expressed during early blood vessel 

CC formation, in the semicircular canal and in a dorsal to ventral 

CC gradient in the retina. 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 
CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 

CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis (By similarity) . 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AJ487853; CAD32251.1; -. 

DR EMBL; AK018177; BAB31108.1; 

DR EMBL; BC048162; AAH48162.1; ALT__INIT . 

DR EMBL; BC057560; AAH57560.1; -. 

DR HSSP; P07996; 1LSL. 

DR MGD; MGI : 8947-03; Unc5b. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like. 

DR InterPro; IPR007110; Ig-like. 



DR 


InterPro; 


IPR003598; Ig_c2. 




DR 


InterPro; 


IPR000884; TSP1. 




DR 


InterPro; 


IPR008085; TSP_1. 




DR 


InterPro; 


IPR000906; ZU5 . 




DR 


Pfam; PF00531; Death; 1. 




DR 


Pfam; PF00047; ig; 1. 




DR 


Pfam; PF00090; TSP 1; 2. 




DR 


Pfam; PF00791; ZU5; 1. 




DR 


PRINTS; PR01705; 


TSP1REPEAT. 




DR 


SMART; SM00005; 


DEATH; 1. 




DR 


SMART; SM00408; 


IGc2; 1. 




DR 


SMART; SM00209; 


TSP1; 2. 




DR 


SMART; SM00218; 


ZU5 ; 1. 




DR 


PROSITE; 


PS50017; DEATH DOMAIN; 1. 


DR 


PROSITE; 


PS50835 


; IG LIKE; 1 




DR 


PROSITE; 


PS50092 


; TSP1; 2. 




KW 


Alternative splicing; Apoptosis; Developmental protein; 


KW 


Immunoglobulin domain; Phosphorylation; Receptor; Repeat; Signal; 


KW 


Transmembrane . 






FT 


SIGNAL 


- 1 


26 


Potential . 


FT 


CHAIN 


27 


945 


Netrin receptor UNC5B. 


FT 


DOMAIN 


27 


377 


Extracellular (Potential) . 


FT 


TRANS MEM 


378 


398 


Potential . 


FT 


DOMAIN 


399 


945 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


48 


145 


Ig-like. 


FT 


DOMAIN 


153 


242 


Ig-like C2-type. 


FT 


DOMAIN 


246 


300 


TSP type-1 1. 


FT 


DOMAIN 


302 


354 


TSP type-1 2. 


FT 


DOMAIN 


541 


644 


ZU5 . 


FT 


DOMAIN 


865 


943 


Death. 


FT 


SITE 


412 


413 


Cleavage (by caspase-3) (By similarity) 


FT 


SITE 


707 


725 


Interaction with DCC (By similarity) . 


FT 


DISULFID 


69 


128 


By similarity. 


FT 


DISULFID 


174 


225 


By similarity. 


FT 


CARBOHYD 


222 


222 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


347 


347 


N-linked (GlcNAc. . .) (Potential). 


FT 


VARSPLIC 


356 


367 


NQRTLNDPKSHP -> T (in isoform 2) . 


FT 








/FTId=VSP_011699. 


FT 


CONFLICT 


238 


238 


T -> A (in Ref . 2) . 


FT 


CONFLICT 


394 


394 


V -> E (in Ref. 2) . 


FT 


CONFLICT 


679 


679 


T -> S (in Ref. 2) . 


FT 


CONFLICT 


874 


874 


N -> D (in Ref. 2) . 


SQ 


SEQUENCE 


945 AA; 103738 MW; 80E896FOFOE06012 CRC64; 



Query Match 55.0%; Score 2563.5; DB 1; Length 945; 

Best Local Similarity 54.3%; Pred. No. 3.7e-184; 

Matches 494; Conservative 146; Mismatches 219; Indels 51; Gaps 12; 

Qy 11 PGANPDLLPHFLVTSPEDWI VT^NKPVTjLVGKAV^ 70 

I I : I I : I I : I I : I I I I I I I I I I I : I I I I I I : I I I I I I I I I III : I I 
Db 40 PSAPAEQLPYFLLEPQDAYIVKNKPVELHCRAFPATQIYFKCNGEWVSQNDHVTQESLDE 99 

Qy 71 SSGLPTMEWINVSRQQ\^KVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEP 130 

:: I I II: I I I I I I I I :: I I I I : I I I I I III I I I I II I I .: : I I I I I I I I I I I .: I I I 
Db 100 ATGLRVREVQIEVSRQQVEELFGLEDYWCQCVAWSSSGTTKSRRAYIRIAYLRKNFDQEP 159 



Qy 131 LAKEVSLEQGIVLPCRPPEGIPPAEVIEWLRNEDLVDPSLDPNvTITREHSLWRQARLAD 190 



I I I I I I : : : I It I I I I : I I I I i I I : I I I : : I I : I I : I : I : I : : I I I I I : I 
160 LAKEVPLDHEVLLQCRPPEGVPVAEVEWLKNEDVIDPAQDTNFLLTI DHNLI IRQARLSD 219 



191 TANYTCVAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPL 250 

I I I I I I I I I I I I I : I I I : I I I I I I I I I I I : I III II I I I I I I I I : I : I II I I I I 
220 TANYTCVAKNIVAKRRSTTATVIVYVNGGWSSWAEWSPCSNRCGRGWQKRTRTCTNPAPL 279 

251 NGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQ 310 

I I I I I I I I I I I I I I I : I I I I I : I : I I I I I I I : I I I I I I I I I hill : I 
280 NGGAFCEGQAFQKTACTTVCPVDGAWTEWSKWSACSTECAHWRSRECMAPPPQNGGRDCS 339 

311 GTDLDTRNCTSDLCV H S AS G P E DVAL YVGL - 1 AVAVCLVLL L LVL I LVY 358 

I I I I :: I I I III I I I I I I I I : I I I : : I : I : : I I 

340 GT LLD S KN CT DGLCVLNQ RT LN D P K SH P LET S GD VAL YAGLVVAVFVVVAVLMAVGVI VY 399 

359 CRKKEGLDSDVADSS-ILTSGFQPVSIKPSKADNPHLL — TIQPDLSTTTTTYQGSLCPR 415 

I I : I : I I I I I I I I I : I : : : I I I I : III:: I : I : 

400 RRNCRDFDTDITDSSAALTGGFHPVNFKTARPNNPQLLHPSAPPDLTASAGIYRGPVYAL 459 

416 QDGPSPKFQLTNGHLLSPLGGGRHTLHHSS PTSEAEEFVSR 456 

I I : I : I I I I I I : : : : I I I : : I I I 

460 QDS-ADKIPMTNSPLLDPLPSLKIKVYNSSTIGSGSGLADGADLLGVLPPGTYPGDF-SR 517 

457 LSTQNYFRS LPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKI 506 

: : I I I II I : III I I I I I : I I I : I I I : I I I I : I I 

518 DTHFLHLRSASLGSQHLLGLPRDPSSSVSGTFGCLGGRLSLPGTGVSLLVPNGAIPQGKF 577 

507 YEIYLTLHKPEDVRLPLA-GCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLR 565 

I : : I I : : I I I I I : I I I : I I I I : I I I I : I I I I I : I : I I I I : 

578 YDLYLHINKAEST-LPLSEGSQTVLSPSVTCGPTGLLLCRPWLTVPHCAEVIAGDWIFQ 636 

566 LKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLK 625 

I I I : : I I I : I : I I I : I I I I I I : I : : : I I I : : I I : I : I . I I I : 
637 LKTQAHQGHWEEWTLDEETLNTPCYCQLEAKSCHILLDQLGTYVFMGESYSRSAVKRLQ 696 

626 LLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRL 685 

I : I I I II I I I I : : I I I I I II I I I I I : : I I : III I : : I I : I I I I I I I I I I I 
697 LAIFAPALCTSLEYSLRVYCLEDTPVALKEVLELERTLGGYLVEEPKPLLFKDSYHNLRL 756 

686 SIHDWSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVE 745 

I : I I : I : I : I I I I II I I I I I I : I I I : I I I II I I I I I I I : :: : I I : I III 

757 SLHDI PHAHWRSKLLAKYQEI PFYHVWNGSQRALHCTFTLERHSLASTEFTCKVCVRQVE 816 

746 GDGQSFSINFNITKDTRFAELLALESEAG — VPALVGPSAFKIPFLIRQKIISSLDPPCR 803 

hill:: : : | I I I I I : : I I I I I I I I I I I I I I I I I 

817 GEGQIFQLHTTLA-ETPAGSLDALCS7VPGNAITTQLGPYAFKIPLSIRQKICSSLDAPNS 875 

804 RGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDA 863 

II III I I II I : I : h : I h I III : I h I I I I I : h h I I : h : h : 

876 RGNDWRLLAQKLSMDRYLN YFATKAS PTGVI LDLWEARQQDDGDLN S LASALEEMGKS EM 935 

864 GLFTVSEAEC 873 

: : : : I 
936 LVAMATDGDC 945 



UN5B RAT 



ID UN5B_RAT STANDARD; PRT; 945 AA. 

AC 008722; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5B precursor (Unc-5 homolog B) (Unc-5 homolog 2). 

GN Name=Unc5b; Synonyms=Unc5h2 ; 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. , FUNCTION, SUBCELLULAR LOCATION, AND TISSUE 

RP SPECIFICITY. 

RX MEDLINE=97271897; PubMed=9126742 ; 

RA Leonardo E.D., Hinck L., Masu M., Keino-Masu K., Ackerman S.L., 

RA Tessier-Lavigne M. ; 

RT "Vertebrate homologues of C. elegans UNC-5 are candidate netrin 

RT receptors."; 

RL Nature 386:833-838(1997). 

RN [2] 

RP FUNCTION, AND INTERACTION WITH DCC. 

RX PubMed= 1 0399920; 

RA Hong K., Hinck L., Nishiyama M. , Poo M.-M., Tessier-Lavigne M. , 

RA Stein E. ; 

RT "A ligand-gated association between cytoplasmic domains of UNC5 and 

RT DCC family receptors converts netrin-induced growth cone attraction to 

RT repulsion."; 

RL Cell 97:927-941(1999). 

RN [3] * 

RP FUNCTION, AND MUTAGENESIS OF ASP-412. 

RX PubMed=11387206; DOI=10 . 1093/emboj /20 . 11 . 2715; 

RA Llambi F., Causeret F., Bloch-Gallego E. , Mehlen P.; 

RT "Netrin- 1 acts as a survival factor via its receptors UNC5H and DCC"; 

RL EMBO J. 20:2715-2722(2001). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 

CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand. 

CC -!- SUBUNIT: Interacts with GNAI2 via its cytoplasmic part (By 

CC similarity) . Interacts with the cytoplasmic part of DCC. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- TISSUE SPECIFICITY: Mainly expressed in regions of differentiating 

CC neurons. Expressed in the developing sensory ganglia that flank 

CC the spinal cord from E12, peaking at E14. Expressed in the roof 

CC plate region of the spinal cord from E14. 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 

CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 

CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis. 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 



- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

- SIMILARITY: Contains 1 immunoglobulin-like. domain . 

- SIMILARITY: Contains 2 TSP type-1 domains. 

- SIMILARITY: Contains 1 ZU5 domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; U87306; AAB57679.1; -. 
HSSP; P07996; 1LSL. 
RGD; 621756; Unc5h2 . 
InterPro; IPR000488 
IPR011029 
IPR007110 
IPR003598 
IPR000884 
IPR008085 
IPR000906 
PF00531; Death; 
PF00047; ig; 1. 



InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
Pf am; 
Pfam; 

Pfam; PF00090; TSP_1; 2. 
Pfam; PF00791; ZU5; 1. 
PRINTS; PR01705; TSP1REPEAT. 



Death. 
DEATH_like. 
Ig-like. 
Ig_c2. 
TSP1. 
TSP_1. 
ZU5. 
1. 



DR 


SMART; SM00005; 


DEATH; 1. 




DR 


SMART; SM00408; 


IGc2; 1. 




DR 


SMART; SM00209; 


TSP1; 2. 




DR 


SMART; SM00218; 


ZU5; 1. 




DR 


PROSITE; 


PS50017; DEATH DOMAIN; 1. 


DR 


PROSITE; 


PS50835; IG_LIKE; 


1. 


DR 


PROSITE; 


PS50092; TSP1; 2. 




KW 


Apoptosis 


; Developmental protein; Immunoglobulin domain; 


KW 


Phosphorylation; 


Receptor; 


Repeat; Signal; Transmembrane. 


FT 


SIGNAL 


1 


26 


Potential. 


FT 


CHAIN 


27 


945 


Netrin receptor UNC5B. 


FT 


DOMAIN 


27 


377 


Extracellular (Potential) . 


FT 


TRANS MEM 


378 


398 


Potential . 


FT 


DOMAIN 


399 


945 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


48 


145 


Ig-like . 


FT 


DOMAIN 


153 


242 


Ig-like C2-type. 


FT 


DOMAIN 


246 


300 


TSP type-1 1. 


FT 


DOMAIN 


. 302 


354 


TSP type-1 2. 


FT 


DOMAIN 


541 


644 


ZU5 . 


FT 


DOMAIN 


865 


943 


Death. 


FT 


SITE 


412 


413 


Cleavage (by caspase-3) . 


FT 


SITE 


707 


725 


Interaction with DCC. 


FT 


DISULFID 


69 


128 


By similarity. 


FT 


DISULFID 


174 


225 


By similarity. 


FT 


CARBOHYD 


222 


222 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


347 


347 


N-linked (GlcNAc. . .) (Potential). 


FT 


MUTAGEN 


412 


412 


D->N: Abolishes cleavage by caspase-3 and 


FT 








subsequent induction of apoptosis. 


SQ 


SEQUENCE 


945 AA; 103520 


MW; 6E9C2A262E560B9B CRC64; 



Query Match 54.9%; Score 2560.5; DB 1; Length. 945; 

Best Local Similarity 54.0%; Pred. No. 6.2e-184; 

Matches 496; Conservative 139; Mismatches 215; Indels 69; Gaps 14; 



Qy 11 PGANPDLLPHFLVEPEDWIVT^KPVXLVCKAVPATQIFFKCNGEWVRQVDHVIERSTDG 70 

II : I I I I I : I I I I I I I I I I I I I I : I I I I I I : I I I I I I I I I I I : I I 

Db 40 PSAPAEQLPHFLLEPEDAYIVKNKPVELHCRAFPATQIYFKCNGEWVSQKGHVTQESLDE 99 

Qy 71 SSGLPTMEVl^IWSRQQv^KVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEP 130 

::M 11:1 I I I I I I I :: I I I I : I I I I I I I I I I I I I I I I :: I I I I I I llllhlll 

Db 100 ATGLRIREVQIEVSRQQVEELFGLEDYWCQCVAWSSSGTTKSRRAYIRIAYLRKNFDQEP 159 

Qy 131 LAKEVSLEQGIVXPCRPPEGIPPAEVTCWLRNEDLVT)PSLDPW 190 

I I I I I I : : : I I I I I I I : I I I I I I I : I I I : : I I : I I : I : I : I : : I I I I I : I 

Db 160 LAKEVPLDHEVLLQCRPPEGVPVAEVEWLKNEDVIDPAQDTNFLLTIDHNLIIRQARLSD 219 

Qy 191 TANYTCVAKNIVARRRSASAAVIVTWGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPL 250 

I I I I I I I I I I I I I : I I I : I I I I I I I I I I I : I III II I I I I I I I I : I : I I I I I I I 

Db 220 TT^YTCVAKNIVAKRRSTTATVIVTWGGWSSWAEWSPCSNRCGRGWQKRTRTCTNPAPL 279 

Qy 251 NGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQ 310 

I I I I I I I I I I I I I I I : I I I I I : I : I I I I I I I : I - II I I I I I I hill : I 

Db 280 NGGAFCEGQACQKTACTTVCPVDGAWTEWSKWSACSTECAHWRSRECMAPPPQNGGRDCS 339 

Qy 311 GTDLDTRNCTSDLCV HSASGPE DVAL YVGL- 1 AVAVC LVL L L LVL I LVY 358 

I I I I :: I I I III : : I : I I I I I I I : I I I I : I : I :: I I 

Db 34 0 GT LL D S KN CT D GLC VLNQ RT LNDPKSRPLEPS GDVAL YAGL VVAVFWLAVXMAVGVT VY 399 

Qy 359 CRKKEGLDSDVADSS-ILTSGFQPVSIKPSKADNPHLL — TIQPDLSTTTTTYQGSLCPR 415 

I I : I : I I I 111111:1:: I I I I : III:: I : I : 

Db 400 RRNCRDFDTDITDSSAALTGGFHPVNFKTARPSNPQLLHPSAPPDLTASAGIYRGPVYAL 459 

Qy 416 QDGPSPKFQLTNGHLLSPL GGG 437 

I I : I : II I I I I II 

Db 460 QDS-ADKIPMTNSPLLDPLPSLKIKVYDSSTIGSGAGLADGADLLGVLPPGTYPGDFSRD 518 

Qy 438 RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIP 497 

I I I I : I : I : III I : III I I I I I I I I I : I I I : I 

Db 519 THFLHLRS ASLGSQ-HLLGLPRDPSSSVSGTFGCLGGRLTIPGTGVSLLVP 568 

Qy 498 PDAIPRGKIYEIYLTLHKPEDVRLPLA-GCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGE 556 

111:11 I : : I I : : I I III: I I I : I I I hill I : I I I I h I : II I 

Db 569 NGAIPQGKFYDLYLRINKTEST-LPLSEGSQTVXSPSVTCGPTGLLLCRPVVLTVPHCAE 627 

Qy 557 PSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEAL 616 

I : I I I : : I I h h I II : I I I I I I : I :: : I I I : I I : 

Db 628 VIAGDWI FQLKTQAHQGHWEEWTLDEETLNTPCYCQLEAKSCHI LLDQLGT YVFTGES Y 687 

Qy 617 SVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHF 67 6 

I : I I I I : I : I I I I I I I I I :: I I I I I I I lllll::||: III hUh I I 

Db 688 SRSAVl^RLQLAIFAPALCTSLEYSLRWCLEDTPAALKEvliELERTLGGYLVEEPKTLLF 747 

Qy 677 KDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLA 736 

I I I I I I I I I I : I I: I : h I I I I I I I I I I I h I I h I : I I I I I I I I I I : : : : 

Db 748 KDSYHNLRLSLHDI PHAHWRSKLIAKYQEI PFYHVWNGSQKALHCTFTLERHSLASTEFT 807 



Qy 737 CKLWVWQVEGDGQS FS INFNITKDTRFAELLALESEAGVPAL — VGPSAFKI PFLI RQKI 794 

I I : I I I I I : I I I : : : ■: I till I I : I I I I I I I Mill 
Db 808 CKVCVRQVEGEGQIFQLHTTLA-ETPAGSLDALCSAPGNAATTQLGPYAFKIPLSIRQKI 866 

Qy 795 ISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAA 854 

: I I I I I I I I I I I I I I : I : I :: I I : I I I I : I I : I I I I I : I : I : I I : I 
Db 867 CNSLDAPNSRGNDWRLLAQKLSMDRYLNYFATKASPTGVILDLWEARQQDDGDLNSLASA 926 

Qy 855 VAGLGQ P DAGL FTVS EAEC 873 

• • I • * ■ 

• • I • • • • • * | 

Db 927 LEEMGKSEMLVAMTTDGDC 945 



RESULT 11 
UN5B_HUMAN 

ID UN5B_HUMAN STANDARD; PRT; 945 AA. 

AC Q8IZJ1; Q86SN3; Q8N1Y2; Q9H9F3; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5B precursor (Unc-5 homolog B) (Unc-5 homolog 2) 

DE (p53-regulated receptor for death and life protein 1) 

DE (UNQ1883/PR04326) . 

GN Name=UNC5B; Synonyms =P 5 3 RDL1, UNC5H2; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBIJTaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 2), TISSUE SPECIFICITY, AND INTERACTION 

RP WITH GNAI2. 

RC TISSUE=Lung; 

RX MEDLINE=22246081; PubMed=12359238 ; DOI=10 . 1016/S0006-291X ( 02 ) 02277-5; 

RA Komatsuzaki K. , Dalvin S., Kinane T.B.; 

RT "Modulation of G(ialpha(2)) signaling by the axonal guidance molecule 

RT UNC5H2 . " ; 

RL Biochem. Biophys. Res. Commun. 297:898-905(2002). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 1), FUNCTION, AND MUTAGENESIS OF ASP-412. 

RX PubMed=12598906; DOI=10 . 1038/ncb943; 

RA Tanikawa C, Matsuda K. , Fukuda S., Nakamura Y., Arakawa H. ; 

RT "p53RDLl regulates of p53-dependent apoptosis . " ; 

RL Nat. Cell Biol. 5:216-223(2003). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM 1). 

RX MEDLINE=22887296; PubMed=12 975309 ; DOI=10 . 1101/gr . 1293003 ;.. 

RA Clark H.F., Gurney A.L., Abaya E., Baker K., Baldwin D., Brush J., 

RA Chen J., Chow B., Chui C, Crowley C, Currell B., Deuel B., Dowd P., 

RA Eaton D., Foster J., Grimaldi C, Gu Q., Hass P.E., Heldens S., 

RA Huang A., Kim H.S., Klimowski L., Jin Y., Johnson S., Lee J., 

RA Lewis L., Liao D., Mark M. , Robbie E., Sanchez C, Schoenfeld J., 

RA Seshagiri S., Simmons L. , Singh J., Smith V., Stinson J., Vagts A., 

RA Vandlen R., Watanabe C, Wieand D., Woods K. , Xie M.-H., Yansura D. , 

RA Yi S., Yu G., Yuan J., Zhang M. , Zhang Z. , Goddard A., Wood W.I., 

RA Godowski P., Gray A.; 

RT "The secreted protein discovery initiative (SPDI), a large-scale 

RT effort to identify novel human secreted and transmembrane proteins: a 



RT bioinf ormatics assessment."; 

RL Genome Res. 13 : 2265-2270 (2003 ) . 

RN [4] 

RP SEQUENCE OF 361-945 FROM N.A. 

RC TISSUE=Amygdala, and Teratocarcinoma; 

RX PubMed=14702039; DOI=10 . 1038/ngl285; 

RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R. , 

RA Wakamatsu A. , Hayashi K. , Sato H., Nagai K. , Kimura K., Makita H., 

RA Sekine M. , Obayashi M. , Nishi T., Shibahara T., Tanaka T., Ishii S., 

RA Yamamoto J. -I., Saito K., Kawai Y., Isono Y., Nakamura Y., 

RA Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M. , 

RA Shiratori A., Sudo H., Hosoiri T., Kaku Y., Kodaira H., Kondo H., 

RA Sugawara M., Takahashi M. , Kanda K., Yokoi T., Furuya T., Kikkawa E . , 

RA Omura Y., Abe K. , Kamihara K., Katsuta N., Sato K. , Tanikawa M. , 

RA Yamazaki M., Ninomiya K., Ishibashi T., Yamashita H., Murakawa K., 

RA Fujimori K., Tanai H. , Kimata M. , Watanabe M. , Hiraoka S., Chiba Y., 

RA Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M. , Hotuta T., 

RA Kusano J., Kanehori K. , Takahashi-Fujii A., Hara H., Tanase T.-O., 

RA Nomura Y., Togiya S., Komai F. , Hara R. , Takeuchi K., Arita M. f 

RA Imose N. f Musashino K. , Yuuki H w Oshima A. , Sasaki N., Aotsuka S. f 

RA Yoshikawa Y. f Matsunawa H. f Ichihara T., Shiohata N., Sano S., 

RA Moriya S., Momiyama H., Satoh N. f Takami S., Terashima Y., Suzuki 0. f 

RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y. f Shimizu F. , Wakebe H., 

RA Hishigaki H., Watanabe T., Sugiyama A. , Takemoto M. , Kawakami B., 

RA Yamazaki M. , Watanabe K., Kumagai A., Itakura S., Fukuzumi Y., 

RA Fujimori Y., Komiyama M. , Tashiro H., Tanigami A., Fujiwara T., 

RA Ono T., Yamada K. , Fujii Y., Ozaki K. , Hirao M. , Ohmori Y. , 

RA Kawabata A., Hikiji T., Kobatake N. f Inagaki H. r Ikema Y., Okamoto S., 

RA Okitani R. , Kawakami T . , Noguchi S. f Itoh T., Shigeta K., Senba T., 

RA Matsumura K. f Nakajima Y. , Mizuno T., Morinaga M. , Sasaki M., 

RA Togashi T . , Oyama M. , Hata H. , Watanabe M. , Komatsu T., 

RA Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y. , Nakagawa K. , 

RA Okumura K. , Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R. f 

RA Nakai K. , Yada T. f Nakamura Y., Ohara O., Isogai T., Sugano S.; 

RT "Complete sequencing and characterization of 21,243 full-length human 

RT cDNAs . " ; 

RL Nat. Genet. 36:40-45(2004). 

RN [5] 

RP DOWN- REGULATION IN CANCER. 

RX PubMed=12655055; DOI=10 . 1073/pnas . 0738063100; 

RA Thiebault K., Mazelin L., Pays L., Llambi F. , Joly M.-O., 

RA Scoazec J.-Y., Saurin J.-C, Romeo G. f Mehlen P.; 

RT "The netrin-1 receptors UNC5H are putative tumor suppressors 

RT controlling cell death commitment."; 

RL Proc. Natl. Acad. Sci. U.S.A. 100:4173-4178(2003). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand. 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC (By 
CC similarity). Interacts with GNAI2 via its cytoplasmic part. 

CC -!- SUBCELLULAR LOCATION: Type I membrane , protein (By similarity). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 



CC IsoId=Q8IZ Jl-1; Sequence=Displayed; 

CC Name=2; 

CC IsoId=Q8IZJl-2; Sequence=VSP_0 11698 ; 

CC -!- TISSUE SPECIFICITY: Highly expressed in brain. Also expressed at 
CC lower level in developing lung, cartilage, kidney and 

CC hematopoietic and immune tissues. 

CC -!- INDUCTION: By p53/TP53. 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 
CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 
CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis. 

CC MISCELLANEOUS: Down-regulated in multiple cancers including 

CC colorectal, breast, ovary, uterus, stomach, lung, or kidney 

CC cancers. 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . $ 

CC ' 

DR EMBL; AY126437; AAM95701.1; -. 

DR EMBL; AB096256; BAC57998.1; -'. 

DR EMBL; AY358351; AAQ88717.1; -. 

DR EMBL; AK022859; BAB14276.1; ALT_INIT. 

DR EMBL; AK094595; BAC04382.1; ALT_INIT. 

DR HSSP; P07996; 1LSL. 

DR Genew; HGNC: 12568; UNC5B. 

DR MIM; 607870; -. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2 . 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR008085; TSP_1. 

DR InterPro; IPR000906; ZU5. 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01705; TSP1REPEAT. 

DR SMART; SM00005; DEATH; 1. 

DR SMART; SM00408; IGc2; 1. 

DR SMART; SM00209; TSP1; 2. 

DR SMART; SM00218; ZU5; 1. 

DR PROSITE; PS50017; DEATH_DOMAIN; FALSE_NEG. 

DR PROSITE; PS50835; IG LIKE; 1. 



DR PROSITE; PS50092; TSP1; 2. 

KW Alternative splicing; Apoptosis; Developmental protein; 

KW Immunoglobulin domain; Phosphorylation; Polymorphism; Receptor; 

KW Repeat; Signal; Transmembrane. 



FT 


SIGNAL 


1 


26 


Potential . 


FT 


CHAIN 


27 


945 


Netrin receptor UNC5B. 


FT 


DOMAIN 


27 


377 


Extracellular (Potential) . 


FT 


TRANS MEM 


J /o 


iyo 


Potential . 


FT 


DOMAIN 


399 


945 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


48 


145 


Ig-like . 


FT 


DOMAIN 


147 


242 


Ig-like C2-type. 


FT 


DOMAIN 


246 


300 


TSP type-1 1. 


FT 


DOMAIN 


302 


354 


TSP type-1 2. 


FT 


DOMAIN 


541 


644 


ZU5. 


FT 


DOMAIN 


865 


943 


Death. 


FT 


SITE 


412 


413 


Cleavage (by caspase-3) . 


FT 


SITE 


707 


725 


Interaction with DCC (By similarity) . 


FT 


DISULFID 


69 


128 


By similarity. 


FT 


DISULFID 


174 


225 


By similarity. 


FT 


CARBOHYD 


222 


222 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


347 


347 


N-linked (GlcNAc. ...) (Potential). 


FT 


VARSPLIC 


356 


367 


NKKTLSDPNSHL -> M (in isoform 2) . 


FT 








/FTId=VSP_011698. 


FT 


VARIANT 


516 


516 


A -> T (in dbSNP: 10509332) . 


FT 








/FTId=VAR_019730. 


FT 


MUTAGEN 


412 


412 


D->N: Abolishes cleavage by caspase-3 and 


FT 








subsequent induction of apoptosis. 


FT 


CONFLICT 


483 


483 


K -> E (in Ref . 3) . 


FT 


CONFLICT 


851 


851 


L -> P (in Ref. 3 ; BAB14276) . 


SQ 


SEQUENCE 


945 AA; 


103637 


MW; 56064E335F323447 CRC64; 



Query Match 54.5%; Score 2541.5; DB 1; Length 945; 

Best Local Similarity 53.4%; Pred. No. 1.7e-182; 
Matches 4 91; Conservative 14 6; Mismatches 229; Indels 



53; Gaps 13; 



Qy 

Db 



32 



SATVANPVPGANPDLLPHFLvTSPEDvTIVTO^KPVXL 62 
I : : II : I I : I I I I : I I I I I I I I I I 1 : I I I I I I : I II I I I I I I I I 
SEVLPDSFPSAPAEPLPYFLQEPQDAYIVKNKPVELRCRAFPATQIYFKCNGEWVSQNDH 91 



Qy 

Db 



63 VIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARL 122 

I : I : : I I I I : I I I I I I I I : : I I I I : I I I I I I I I I I : I I I I I : : I I : I I I I 
92 VTQEGLDEATGLRVREVQIEVSRQQVEELFGLEDYWCQCVAWSSAGTTKSRRAYVRIAYL 151 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



123 RKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAEV^WLRNEDLV1)PSLDPNVYITREHSLV 182 

1111:1111 III I : : : I I I I I I I : I I I I I I I : I I I : : I I : I I : I : I : I : 
152 RKNFDQEPLGKEVPLDHEVLLQCRPPEGVPVAEVEWLKNEDVIDPTQDTNFLLTIDHNLI 211 

183 VT^QARLADTANYTCVAKNIVARRRSASAAVIW 242 

: I I I I I : I I I I I I I I I I I I I I : I I I : I I II I I I I I I I : I III II I I I I I I I I : I 
212 IRQARLSDTANYTCVAKNIVAKRRSTTATVIVYVNGGWSSWAEWSPCSNRCGRGWQKRTR 271 



243 



302 



SCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAP 
: I I I I I I II I I I I I I I I I I I I I I : I I I I I : I : I I I I I II : I I II I I I I I I 
272 T CTN PAP LN GGAFC EGQAFQKT ACT TIC PVDGAWT EW.S KW SAC S T ECAHWRS RECMAP P P 331 



303 RNGGEECQGTDLDTRNCTSDLCVHS- 
: I I I : I I I I I : : II I I I : : 



-AS G P E DVAL YVGL - 1 AVAVC LV 348 
III I I I I I I : I : I : 



Db 



332 QNGGRDCSGTLLDSKNCTDGLCMQNKKTLSDPNSHLLEASG — DAAL YAGLWAI FVWA 389 



Qy 349 LLLLVLILVYCRKKEGLDSDVADSS-ILTSGFQPVSIKPSKADNPHLL — TIQPDLSTTT 405 

:|: I ::|| I |:|: III II II I I : I : : II II :: |||: : 

Db 390 ILMAVGVWYRRNCRDFDTDITDSSAALTGGFHPVNFKTARPSNPQLLHPSVPPDLTASA 449 

Qy 406 TTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPT SEAEEFVSRLST 459 

I : I : II : I : I I I I I I : : : I I I : : : : I 

Db 450 GIYRGPVYALQDS-TDKIPMTNSPLLDPLPSLKVKVYSSSTTGSGPGLADGADLLGVLPP 508 

Qy 460 QNY FRS LPRGTSNMTYGTFNFLGGRLMIPNTGISLLIP 497 

I II III : I I I I I I I I I I I I : I I I : I 

Db 509 GTYPSDFARDTHFLHLRSASLGSQQLLGLPRDPGSSVSGTFGCLGGRLSIPGTGVSLLVP 568 

Qy 4 98 PDAIPRGKIYEIYLTLHKPEDVRLPLA-GCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGE 556 

111:11 11:11 ::| I III: I MUM hill I : I I Mill I II I 
Db 569 NGAIPQGKFYEMYLLINKAEST-LPLSEGTQTVLSPSVTCGPTGLLLCRPVTLTMPHCAE 627 

Qy 557 PSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEAL 616 

I I : I I I : : I I I : I : I I I : I I I II I I : : : I I I : I I : 
Db 628 VSARDWIFQLKTQAHQGHWEEWTLDEETLNTPCYCQLEPRACHILLDQLGTYVFTGESY 687 

Qy 617 SVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHF 676 

I : I I I I : I : I I I I I I I I I : : I I I I I II I I I I I : : I I : III I : : I I : I I 
Db 688 SRSAVKRLQLAVFAPALCTSLEYSLRVYCLEDTPVALKEVLELERTLGGYLVEEPKPLMF 747 

Qy 677 KDS YHNLRLS IHDVPS SLWKSKLLVS YQEI PFYHIWNGTQRYLHCTFTLERVS PSTSDLA 736 

I I I I I I I I I I : I I : I : I : I I I I I I I I I I I I I I : I : I : I I I I I I I I I I :: : : I 
Db 748 KDSYHNLRLSLHDLPHAHWRSKLLAKYQEIPFYHIWSGSQKALHCTFTLERHSLASTELT 807 

Qy 737 CKLWVWQVEGDGQS FS INFNITKDTRFAELLALESEAG — VPALVGPSAFKI PFLI RQKI 794 

I I : I I I I I : I I I : : : : I I II I I : I I I I I I I I I I I I 

Db 808 CKICVRQVEGEGQIFQLHTTLA-ETPAGSLDTLCSAPGSTVTTQLGPYAFKIPLSIRQKI 866 

Qy 795 ISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAA 854 

: I I I I II I I I I I I I I : I : I : : I I : I I I I : I I : II I I : I : I : I I : I 
Db 867 CNSLDAPNSRGNDWRMLAQKLSMDRYLNYFATKASPTGVILDLWEALQQDDGDLNSLASA 926 

Qy 855 VAGLGQPDAGLFTVSEAEC 873 

• r|it i i x i \ 
Db 927 LEEMGKSEMLVAVATDGDC 945 



RESULT 12 
UN5D_HUMAN 

ID UN5D_HUMAN STANDARD; PRT; 953 AA. 

AC Q6UXZ4; Q8WYP7; < 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5D precursor (Unc-5 homolog D) (Unc-5 homolog 4) 
DE (UNQ6012/PRO34692) . 

GN Name=UNC5D; Synonyms=KIAA1777 , UNC5H4 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI TaxID=9606; 



RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RX MEDLINE=22887296; PubMed=12975309 ; DOI=10 . 1101/gr . 1293003; 

RA Clark H.F., Gurney A.L., Abaya E., Baker K., Baldwin D., Brush J., 

RA Chen J., Chow B., Chui C, Crowley C, Currell B., Deuel B., Dowd P., 

RA Eaton D., Foster J., Grimaldi C. , Gu Q. , Hass P.E., Heldens S., 

RA Huang A. , Kim H.S., Klimowski L., Jin Y. , Johnson S., Lee J., 

RA Lewis L., Liao D., Mark M. , Robbie E., Sanchez C. , Schoenfeld J., 

RA Seshagiri S., Simmons L., Singh J., Smith V. , Stinson J., Vagts A., 

RA Vandlen R. , Watanabe C, Wieand D., Woods K. , Xie M.-H., Yansura D., 

RA Yi S. f Yu G. f Yuan J., Zhang M., Zhang Z., Goddard A., Wood W.I., 

RA Godowski P., Gray A.; 

RT "The secreted protein discovery initiative (SPDI), a large-scale 

RT effort to identify novel human secreted and transmembrane proteins: a 

RT bioinf ormatics assessment."; 

RL Genome Res. 13:2265-2270(2003). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RC TISSUE=Brain; 

RA Nakajima D., Nakayama M., Nagase T., Ohara O. ; 

RT "Identification of unc5H4 gene." ; 

RL Submitted (JAN-2001) to the EMBL/GenBank/DDB J databases. 

CC -!- FUNCTION: Receptor for netrin. May be involved in axon guidance by 

CC mediating axon repulsion of neuronal growth cones in the 

CC developing nervous system upon ligand binding. Axon repulsion in 

CC growth cones may be caused by its association with DCC that may 

CC trigger signaling for repulsion. It also acts as a dependence 

CC receptor required for apoptosis induction when not associated with 

CC netrin ligand (By similarity) . 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC IsoId=Q6UXZ4-l; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=Q6UXZ4-2; Sequence=VSP_011703; 

CC Note=No experimental confirmation available; 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 

CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 

CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC - apoptosis (By similarity). 

CC -!- SIMILARITY: Belongs to._the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 
CC SIMILARITY: Contains 1 ZU5 domain. 



CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL out station - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 





entities 


requires 


a license agreement (See http://www.isb-sib.ch/announ 


cc 
UK 


or send an email to license@isb-sib . ch) . 


EMBL; AB055056; BAB83663.1; -. 


JJK 


EMBL; AY358147; AAQ88514.1; 


UK 


HSSP; P07996; 1LSL 




UK 


Genew; HGNC: 18634; 


UNC5D. 


UK 


InterPro, 


IPR000488; Death. 


UK 


InterPro; 


IPR011029; DEATH_like. 


DR 


InterPro; 


IPR007110; Ig-like. 


DR 


InterPro, 


IPR003598; Ig_c2 . 


DR 


InterPro; 


IPR000884; TSP1. 


DR 


InterPro, 


IPR008085; TSP_1. 


DR 


InterPro; 


IPR000906; ZU5. 


DR 


Pfam; PF00531; Death; 1. 


DR 


Pfam; PF00047; ig; 


1. 


DR 


Pfam; PF00090; TSP 


1; 2. 


DR 


Pfam; PF00791; ZU5; 1. 


DR 


PRINTS; PRO 17 05; TSP1REPEAT. 


DR 


SMART; SM00005; DEATH; 1. 


DR 


SMART; SM00408; IGc2; 1. 


UK 


SMART; SM00209; TSP1; 2. 


UK 


PROSITE; 


PS50017; 


DEATH_DOMAIN; FALSE_NEG. 


DR 


PROSITE; 


PS50835; 


IG LIKE; 1. 


DR 


PROSITE; 


PS50092; 


TSP1; 2. 


KW 


Alternative splicing; Apoptosis; Developmental protein; 


KW 


Immunoglobulin domain; Phosphorylation; Receptor; Repeat; Signal; 


KW 


Transmembrane . 




FT 


SIGNAL 


1 


32 Potential. 


FT 


CHAIN 


33 


953 Netrin receptor UNC5D. 


FT 


DOMAIN 


33 


379 Extracellular (Potential) . 


FT 


TRANSMEM 


380 


400 Potential. 


FT 


DOMAIN 


401 


953 Cytoplasmic (Potential) . 


FT 


DOMAIN 


54 


151 Ig-like. 


FT 


DOMAIN 


153 


244 Ig-like C2-type. 


FT 


DOMAIN 


252 


306 TSP type-1 1. 


FT 


DOMAIN 


308 


360 TSP type-1 2. 


FT 


DOMAIN 


540 


642 ZU5. 


FT 


DOMAIN 


859 


936 Death. 


FT 


SITE 


416 


417 Cleavage (by caspase-3) (By similarity) . 


FT 


SITE 


703 


721 Interaction with DCC (By similarity) . 


FT 


DISULFID 


•75 


134 By similarity. 


FT 


DISULFID 


180 


231 By similarity. 


FT 


CARBOHYD 


117 


117 N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


228 


228 N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


353 


353 N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


376 


376 N-linked (GlcNAc. . .) (Potential). 


FT 


VARSPLIC 


1 


34 MGRAAATAGGGGGARRWLPWLGLCFWAAGTAAAR -> MIL 


FT 






VLVKALSDVCAGTSGFLLDFSSQTSP (in isoform 


FT 






2) . 


FT 






/FTId=VSP_011703. 


SQ 


SEQUENCE 


953 AA; 


105879 MW; 5F893B9DF746F731 CRC64; 



Query Match 46.9%; Score 2185.5; DB 1; Length 953; 

Best Local Similarity 46.3%; Pred. No. 1.2e-155; 

Matches 421; Conservative 159; Mismatches 268; Indels 61; Gaps 12; 



10 WGANPDLLPHFLVEPEDVYIVKNKPVLLVCKAVPATQIFFKCNGEWVRQVDHVIERSTD 69 

: I I I MM: I I : I I I : I : I : I III II I I I I I I I I I I I I : I I I : I 
46 I PSA-PGTLPHFIEEPDDAYI I KSNPIALRCKARPAMQI FFKCNGEWVHQNEHVSEETLD 104 

70 GSSGLPTMEVRIWSRQQVEWFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQE 129 
I I I I II 111:11111 I I : I I I I I I I I I I I : I I : I I : I I I I I I I I II : 
105 ESSGLKVREVFINVTRQQVEDFHGPEDYWCQCVAWSHLGTSKSRKASVRIAYLRKNFEQD 164 

130 PLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHSLVVRQARLA 189 

I :M :| I I I 1 I I I I I : I I I I I I I : I I :: I I I : :|:|::|||||: 
165 PQGREVPI EGMI VLHCRPPEGVPAAEVEWLKNEEPIDSEQDENI DTRADHNLI I RQARLS 224 

190 DTANYTCVAKNIVARRRSASAAVIVYWGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAP 249 

I : I I I I : I lllhlll II I : I I I I I I I I : I I I I I I : I I I I I I I I I I : I I I I I I 
225 DSGNYTCMAANIVAKRRSLSATWVYWGGWSSOT 284 

250 LNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEEC 309 

111111111:111 I : I I II I I I I Ihlll : I I I I I I : I I I I I I : I 
285 LNGGAFCEGMSVQKITCTSLCPVDGSWEVWSEWSVCSPECEHLRIRECTAPPPRNGGKFC 344 

310 QGTDLDTRNCT S DLCV HSASGPEDVALYVGLIAVAVCLVLLLLVLI LVY 358 

: I : : I I I I I : I I : I I I I I II I : I : : : : I 

345 EGLSQESENCTDGLCILDKKPLHEIKPQSIENASDIALYSGL-GAAWAVAVLVIGVTLY 403 

359 CRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL — TIQPDLSTTTTTYQGSLCPRQ 416 

I : I I I I I I I I I I : I : I II : I I I I I : I I I : I I 

404 RRSQSDYGVDVIDSSALTGGFQTFNFKTVRQGNSLLLNSAMQPDL-TVSRTYSGPIC-LQ 461 

417 DGPSPKFQLTNGHLLSPLG GGRHTLHH 443 

I I I : I I : I I II: 
462 D-PLDKELMTESSLFNPLSDIKVKVQSSFMVSLGVSERAEYHGKNHSRTFPHGNNHSFST 520 

444 SSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPR 503 

I : : : : II III I III I I I I I : : I I I I : I I I I I III 

521 MHPRNKM-PYIQNLS SLPTRTELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPE 573 

504 GKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWS 563 

: II I : : : : : I I I : I I I I I : I I I I : : : I I I : I I : I : I : 
574 ENSWEIYMSINQGEP-SLQSDGSEVLLSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWN 632 

564 LRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKR 623 

: I I I : : : I I I : I : : : I : I III: I I : I : I : I I I I : : I I : 
633 IHLKKRTQQGKWEEVMSVEDESTS — CYCLLDPFACHVLLDSFGTYALTGEPITDCAVKQ 690 

624 LKLLLFAPYACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNL 683 

II: : I :-:l I I : I I : I I I I : : I I : I I I I : I I I I : : I I : : I I I I : : I ~ 
691 LKVAVFGCMSCNSLDYNLRVYCVDNTPCAFQEWSDERHQGGQLLEEPKLLHFKGNTFSL 750 

684 RLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQ 743 

: : I : I : I II: I : I I : I I : I : : I I I I : I I I : I : I : I : I I : : I 
751 QISVLDIPPFLWRIKPFTACQEVPFSRVWCSNRQPLHCAFSLERYTPTTTQLSCKICIRQ 810 

744 VEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCR 803 

::| I : :| : I : .:. : II II Mill: III: I :: I I 

811 LKGHEQILQVQTSILESERETITFFAQEDSTFPAQTGPKAFKIPYSIRQRICATFDTPNA 870 



804 RGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDA 863 



: I II: MM :: :||:||:: I I : I : I I I I I I I I I :|:| III: :|: 
Db 871 KGKDWQMLAQKNSINRNLSYFATQSSPSAVTLNLWEARHQHDGDLDSLACALEEIGRTHT 930 



Qy 864 GLFTVSEAE 872 

I :||:: 

Db 931 KLSNISESQ 939 

RESULT 13 
UN5D_MOUSE 

ID UN5D_MOUSE STANDARD; PRT; 956 AA. 

AC Q8K1S2; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5D precursor (Unc-5 homolog D) (Unc-5 homolog 4) . 

GN Name=Unc5d; Synonyms=Unc5h4 ; 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFIICTY. 

RX MEDLINE=22239710; PubMed=12351186; DOI=10 . 1016/S0925-4773 ( 02 ) 00248-4 ; 

RA Engelkamp D.; 

RT "Cloning of three mouse Unc5 genes and their expression patterns at 

RT mid-gestation."; 

RL Mech. Dev. 118 : 191-197 ( 2002 ) . 

CC -!- FUNCTION: Receptor for netrin involved in cell migration. May be 

CC involved in axon guidance by mediating axon repulsion of neuronal 

CC growth cones in the developing nervous system upon ligand binding. 

CC Axon repulsion in growth cones may be caused by its association 

CC with DCC that may trigger signaling for repulsion. It also acts as 

CC a dependence receptor required for apoptosis induction when not 

CC associated with netrin ligand (By similarity) . 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC (By 

CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- TISSUE SPECIFICITY: Expressed in developing limb and mammary 

CC gland. 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 

CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 

CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC. apoptosis (By similarity) . 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like . domain . 

CC . -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 



CC modified and this statement is not removed. Usage by and for commercial 
CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
CC or send an email to license@isb-sib. ch) . 



DR EMBL; AJ487854; CAD32252.1; -. 

DR HSSP; P07996; 1LSL. 

DR MGD; MGI: 2389364; Unc5d. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2. 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR008085; TSP_1. 

DR InterPro; IPR000906; ZU5 . 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP 1; 2. 



DR 


Pfam; PF00791; ZU5; 1. 




DR 


PRINTS; PR01705; 


TSP1REPEAT 




DR 


SMART; SM00005; DEATH; 1. 




DR 


SMART; SM00408; IGc2; 1. 




DR 


SMART; SM00209; TSP1; 2. 




DR 


PROSITE; 


PS50017; 


DEATH DOMAIN; FALSE NEG. 


DR 


PROSITE; 


PS50835; 


IG_LIKE; 


1. 


DR 


PROSITE; 


PS50092; 


TSP1; 2. 




KW 


Apoptosis 


; Developmental protein; Immunoglobulin domain; 


KW 


Phosphorylation; 


Receptor; 


Repeat ; Signal ; Transmembrane . 


FT 


SIGNAL 


1 


30 


Potential . 


FT 


CHAIN 


31 


956 


Netrin receptor UNC5D. 


FT 


DOMAIN 


31 


382 


Extracellular (Potential) . 


FT 


TRANS MEM 


383 


403 


Potential . 


FT 


DOMAIN 


404 


956 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


52 


149 


Ig-like. 


FT 


DOMAIN 


151 


242 


Ig-like C2-type. 


FT 


DOMAIN 


250 


304 


TSP type-1 1. 


FT 


DOMAIN 


306 


358 


TSP type-1 2. 


FT 


DOMAIN 


543 


645 


ZU5. 


FT 


DOMAIN 


862 


939 


Death. 


FT 


SITE 


419 


420 


Cleavage (by caspase-3) (By similarity) . 


FT 


SITE 


706 


724 


Interaction with DCC (By similarity) . 


FT 


DISULFID 


73 


132 


By similarity. 


FT 


DISULFID 


178 


229 


By similarity. 


FT 


CARBOHYD 


115 


115 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


226 


226 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


351 


351 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


. 379 


379 


N-linked (GlcNAc. . .) (Potential).. 


SQ 


SEQUENCE 


956 AA; 106351 


MW; DFDF07839C10C68D CRC64; 



Query Match 46.8%; Score 2183; DB 1; Length 956; 

Best Local Similarity 45.9%; Pred. No. 1.8e-155; 

Matches 424; Conservative 157; Mismatches 271; Indels 72; Gaps 13; 

Qy 3 SATVANPVPGANPDLLPHFLVTSPEDVYI VTCNKPVX 62 

I : : : I I I II II : I I I I I I : I : I : I III II I I I II I I I II I I : 1 
Db 37 SEILPDSI PSA- PGTLPHFI EEPEDAYI I KSNPI ALRCKARPAMQI FFKCNGEWVHQNEH 95 



Qy 



63 VIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARL 122 



Db 



I I I I I I I I I I I I I : I I I I I 11:111111111 I I : I I : I I : I I I I 
96 VSEESLDESSGLKVREVFINVTRQQVEDFHGPEDYWCQCVAWSHLGTSKSRKASVRIAYL 155 



Qy 123 RKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVYITREHSLV 182 

I I II I I : I : I I : I I I I I I I I II : I I I I I I I : I I : : I I I : : I : I : 
Db 156 RKNFEQDPQGREVPIEGMIVLHCRPPEGVPAAEVEWLKNEEPIDSEQDENIDTRADHNLI 215 

Qy 183 VRQARLADTAN YTCVAKN I VARRRS AS AAVI EWS VC SAS CGRGWQKRS R 242 

: I I I I I : I : I I I I : I I I I I : I I I II I : I I I I I I I I : I I I I I I : I I I I I I I I I I 
Db 216 IRQARLSDSGNYTCMAANIVAKRRSLSATVVVYWGGWSSWTEWSAC^r\^CGRGWQKRSR 275 

Qy 243 SCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAP 302 

: I I I I II I I I I I I I I I : I I I I I I I I I I I I 11:111 : I I I I I I : I I 
Db 276 TCTNPAPLNGGAFCEGMSVQKITCTALCPVDGSWEVWSEWSVCSPECEHLRIRECTAPPP 335 

Qy 303 RNGGEECQGTDLDTRNCTSDLCV HSASGPEDVALYVGLIAVAVC 346 

I I I I : I : I : : I I I I I : I : I I I I I II 

Db 336 RNGGKFCEGLSQESENCTDGLCILDKKPLHEIKPQRWSRRGIENASDIALYSGL-GAAW 394 

Qy 347 LVLLLLVLI LVYCRKKEGLDSDVADSSI LTSGFQPVS I KPSKADNPHLL — TIQPDLSTT 404 

I : I : : : : I I I I I I I I I I I I : I : I II : I I I I I 

Db 395 AVAVLVIGVTLYRRSHSDYGVDVIDSSALTGGFQTFNFKTVRQGNSLLLNPAMQPDL--TV 453 

Qy 405- TTTYQGSLCPRQDGPSPKFQLTNGHLLSPLG GGRH — 439 

: I I I : I I I I I : I I : I I II 
Db 454 SRTYSGPIC-LQD-PLDKELMTESSLFNPLSDIKVTCVQSSFMVSLGVSERAEYHGKNHSG 511 

Qy 4 40 TLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIP 4 88 

I : I : I :: II III I III I I I I I :: I 

Db 512 TFPHGNNRGFSTIHPRNKT PYIQNLS SLPTRTELRTTGVFGHLGGRLVMP 561 

Qy 489 NTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI 548 

111:11111 III :|||::::: | | I : I I I I I : I I I I : I : I I 

Db 562 NTGVSLLIPHGAIPEENSWEIYMSINQGEP-SLQSDGSEVLLSPEVTCGPPDMLVTTPFA 620 

Qy 549 LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGR 608 

I : I I : I : I : : I I I : : : I I I : I : : : I : I I I I : I I : I : I 
Db 621 LTIPHCADVSSEHWNIHLKKRTQQGKWEEVMSVEDESTS — CYCLLDPFACHVLLDSFGT 678 

Qy 609 FALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLI 668 

: I I I I : : I I : I I : : I : : I I I : I I : I I I I : : I I : I I : I : MM: 
Db 679 YALTGEPITDCAVKQLKVAVFGCMSCNSLDYNLRVYCVDNTPCAFQEVISDERHQGGQLL 738 

Qy 669 QEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERV 728 

: I I : : I I I I : : I : : I : I : I I I : I : I I : I I : I : : : I I I I : I I I 
Db 739 EEPKLLHFKGNTFSLQVSVLDIPPFLWRIKPFTACQEVPFSRVWSSNRQPLHCAFSLERY 798 

Qy 729 SPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPF 788 

: I : I : I : I I : : I : : I I : : I : I : : : II II Mill: 

Db 799 TPTTTQLSCKICIRQLKGHEQILQVQTSILESERETITFFAQEDSTFPAQTGPKAFKIPY 858 

Qy 789 LIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPT/^MILNLWEARHFPNGNL 848 

I I I : I : : I I : I I I : ! I I I : : : I I : I I : : II : I : II II II I I I : I : I 
Db 859 SIRQRICATFDTPNAKGKDWQMLAQKNSINRNLS YFATQS S PSAVILNLWEARHQQDGDL 918 

Qy 849 SQ LAAAVAG LGQ P DAGL FT VS EAE 872 

III: : I : I :: I : 



Db 919 DSLACALEEIGRTHTKLSNITEPQ 942 



RESULT 14 
Q7PW78 



ID Q7PW78 PRELIMINARY; PRT; 876 AA. 

AC Q7PW78; 

DT 01-MAR-2004 (TrEMBLrel. 26, Created) 

DT 01-MAR-2004 (TrEMBLrel. 26, Last sequence update) 

DT 01-MAR-2004 (TrEMBLrel. 26, Last annotation update) 

DE ENSANGP00000005212 (Fragment) . 

GN Name=ENSANGGO 0000004014; 

OS Anopheles gambiae str. PEST. 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles. 

OX NCBI_TaxID=180454; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=PEST; 

RA Anopheles Genome Sequencing Consortium; 

RL Submitted (APR-2003) to the EMBL/ GenBank/DDBJ databases. 

CC -!- CAUTION: The sequence shown here is derived from an 

CC EMBL/ GenBank/DDBJ whole genome shotgun (WGS) entry which is 

CC preliminary data. 

DR EMBL; AAAB01008984 ; EAA14755.1; -. 

DR GO; GO: 0005515; F:protein binding; IEA. 

DR GO; GO: 0007165; P: signal transduction; IEA. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR000906; ZU5. 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR PROSITE; PS50017; DEATH_DOMAIN; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

DR PROSITE; PS50092; TSP1; 2. 

FT NON_TER 1 1 

FT NON_TER 876 876 

SQ SEQUENCE 876 AA; 96301 MW; 6F9336D53E096E00 CRC64; 



Query Match 22.5%; Score 1050.5; DB 2; Length 876; 

Best Local Similarity 30.6%; Pred. No. 4.1e-70; 

Matches 284; Conservative 138; Mismatches 368; Indels 137; Gaps 30; 



Qy 18 LPHFLV^PEDWIVl^KPVTiLVCKAVPATQIFFKCNGEWVRQVDHVIERSTDGSSGLPTM 77 

II Mill: |::||:| I III I II 111=1 : I II ||: 

Db 2 LPVFLVEPKGAYVMKNRPAKLYCKASHALQISFKCSGS-TKPPPTEKEHHTDPHSGVQLQ 60 

Qy 78 EWINVSRQQVllKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSL 137 

I : : | : | : : | | : I : I I : I I I I : I I : : I : : I I I : 

Db 61 EAT AT I T RELVDE FFGKG P FKC EC RAY S S RGHVKTQ P VT I QVAT IKKQISISP K I VRVAT 120 

Qy 138 EQGIVXPCRPPEGIPPAEVT2WLRNEDLVT)PSLDPNv^ 197 

II I I : I I I : I I : I : I : : I : : : : I I I I I I I 

Db 121 GGRAELNC-IANATPAAKWWLKNS — VPVHANPPFVLLTENALLIARVEIQDMANYTCV 177 



Qy 198 AKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSA-SCGRGWQKRSRSCTNPAPLNGGAFC 256 

I : I I : I I : I I I : I I I I : I hi I : I I I : I I : I I : I I I I 

Db 178 AENIAGKRVSDPVPITVYVDGGWSSWGPWTDCKCPGHGKQGQKRTRVCNSPVPMNNGAPC 237 

Qy 257 EGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPA-PRNGGE — ECQGTD 313 

: I : : I I I I I I I : II I I I I I III I : I II 

Db 238 KGASTESTPDCLPCSA-GRWSSWSEWSECGPDCTQIRQRSCVAQAFAIDSGTIVNCAGKS 296 

Qy 314 LDTRNCTSDLCVHSASGPE-DVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADS 372 

: II I I : : I I I : I I : I I I I : : I ■ : I : I : : : 

Db 297 QQS I KCTGGLCN YTAQDSNWS VYLWVTLVA-AFCLGVVFAV- - S KFLRRKKT I PAYNLAR 353 

Qy 373 SILTSGFQPVSIK-PSKAD-NPHLLTIQP DLSTTTTT 407 

III I:: : I I I I I I . I I 

Db 354 SDLTQNTGPINYEYPMTASLQPHLAGHHPHHHHLQQQQHHHHLSSLHHQHGSSLLGPVAT 413 

Qy 408 YQGSL CPRQDGPSPKFQLTNGHLLSPLGGGRHTL HHSS 445 

II III: | : | | : : | | | 

Db .414 GHGQLHPQCQSQQAPT LPIGGLKSSLPLPRSNSEHHYDVPHLCNNTSPA 4 62 

Qy 446 — PTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPR 503 

I I : I I I : : I : I I I : : I I I I I I I : 

Db 463 IRPTTLVEE P F R G AE VT HAT L T P AG AL L R LAT Y S T AL L I P E G A I P K 508 

Qy 504 GKIYEIYLTLHKPE--DVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDS 561 

:::!:::: I : I I I I I * I III : :|::: : II I I 

Db 509 HQRHSVALSIVRDDKHHVPVPTGPRSTYLSPWFCGPVDTKWKPIVMQLPHCAENLSD- 567 

Qy 562 WSLRLKKQSCEGS-WEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALS 617 

I : I : I I : : I I I : I :: I I I I I :: I I I I : : 

Db 568 WAFSLYSAPDNWPWCKVVTIGEETLNTPALVQIDKRYAYVLTETFGKYVLVGESATDIQ 627 

Qy 618 -VAAAKRLKLLLFAPVACTSL-EYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLH 675 

I I I I : I : I : ::IM : I I : |:::|| h III 

Db 628 ERVACKRLRLFICGPSTVPEFSDVSLRVYIVEDNPGAEERCRHCEQEIGGVLLGRSTVLH 687 

Qy 676 FKD-SYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSD 734 

II I : : I I : : I I I I I I : II I I I : I I I 

Db 688 FADVGQAGLNI DLQCVGG — WRAKS S S ERQT I P FSHVWN S ACTALHC S FT L 736 

Qy 735 LACKLWVWQVEGDGQS FS INFN ITKDTRFA ELLALESEAGVPA 777 

I : II II : : I I I : I I I I 
Db 737 — CR TEHDKCDFKIWQASQDVPQGLDERLTAI GVPATLSISSVGSGDHNT 785 

Qy 778 - LVGPSAFKI PFLI RQKI I S S LDP PCRRGADWRTLAQKLHLDSHLS FFAS KP S PTAMI LN 836 

II I : : : : : I : MM:: Mill I : : I : I : : II : : I I I I I I : 
Db 786 NLVATDRFRLSKDVKRKLCRCLDPPTQKRNDWRMLAAHLNVDRYLTYFATRPSPTDQILD 845 

Qy 837 LWEARHFPNGNLSQLAAAVAGLGQPDA 863 

MM: Ml : : II I 

Db 846 LWECRNRDLNALQQLIEICRTMERPDA 872 



RESULT 15 
UNC5_DROME 

ID UNC5 DROME STANDARD; PRT; 1072 AA. 



AC Q95TU8; Q9NBL0; Q9V7B5; 

DT 25-OCT.-2004 (Rel. .45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-JAN-2005 (Rel. 46, Last annotation update) 

DE Netrin receptor unc-5 precursor (Unc5 netrin receptor) . 

GN Name=unc-5; ORFNames=CG8166; 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae; Drosophila. 

OX NCBI_TaxID=7227; 

RN [1] 

RP SEQUENCE FROM N.A., FUNCTION, AND TISSUE SPECIFICITY. 

RX PubMed=11719202; 

RA Keleman K., Dickson B.J. ; 

RT "Short- and long-range repulsion by the Drosophila Unc5 netrin 

RT receptor."; 

RL Neuron 32:605-617(2001). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Berkeley; 

RX MEDLINE=20196006; PubMed=10731132 ; DOI=10 . 1126/science . 287 . 54 61 . 2185; 

RA Adams M.D., Celniker S.E., Holt R.A., Evans C.A., Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A., Galle R.F., 

RA George R.A. , Lewis S.E., Richards S., Ashburner M., Henderson S.N., 

RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q. , Chen L.X., 

RA Brandon R.C., Rogers Y.-H.C, Blazej R.G., Champe M. , Pfeiffer B.D., 

RA Wan K.H., Doyle C, Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., 

RA Abril J.F., Agbayani A., An H.-J., Andrews-Pf annkoch C, Baldwin D., 

RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., 

RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari.D., Bolshakov S., 

RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., 

RA Cherry J.M., Cawley S., Dahlke C, Davenport L.B., Da vies P., 

RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., 

RA Dodson K., Doup L.E., Downes M. , Dugan-Rocha S., Dunkov B.C., Dunn P., 

RA Durbin K.J., Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W. , 

RA Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., 

RA Glodek A., Gong F./ Gorrell J.H., Gu Z., Guan P., Harris M. , 

RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., 

RA Hostin D., Houston K.A. , Howland T.J., Wei M.-H., Ibegwam C, 

RA Jalali M. , Kalush F. , Karpen G.H., Ke Z., Kennison J. A., Ketchum K.A. , 

RA Kimmel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D., Lai Z., 

RA Lasko P., Lei Y., Levitsky A. A., Li J.H., Li Z., Liang Y., Lin X., 

RA Liu X., Mattei B., Mcintosh T.C., McLeod M.P., McPherson D., 

RA Merkulov G. , Milshina N.V., Mobarry C, Morris. J., Moshrefi A., 

RA Mount S.M., Moy M. , Murphy B., Murphy L., Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A., Nixon K. , Nusskern D.R., Pacleb J.M. , 

RA Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V., Reese M.G., 

RA Reinert K., Remington K. , Saunders R.D.C., Scheeler F., Shen H., 

RA Shue B.C., Siden-Kiamos I., Simpson M. , Skupski M.P., Smith T., 

RA Spier E . , Spradling A.C., Stapleton M. , Strong R. , Sun E., 

RA Svirskas R. , Tector C, Turner R., Venter E. , Wang A.H., Wang X., 

RA Wang Z.-Y., Wassarman D . A. , Weinstock G.M., Weissenbach J., 

RA Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A. , 

RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M. , Zhang G., Zhao Q., Zheng L., 

RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O., 



RA Gibbs R.A. , Myers E.W., Rubin G.M., Venter J.C.; 

RT ."The genome sequence of Drosophila melanogaster . "; . 

RL Science 287:2185-2195(2000) . 

RN [3] 

RP GENOME REANNOT AT I ON . 

RX MEDLINE=22426069; PubMed=12 537572"; 

RA Misra S., Crosby M.A. , Mungall C.J., Matthews B.B., Campbell K.S., 

RA Hradecky P., Huang Y. , Kaminker J.S., Millburn G.H., Prochnik S.E., 

RA Smith CD., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., 

RA Bettencourt B.R., Celniker S.E., de Grey A. D.N. J., Drysdale R. A. , 

RA Harris N.L., Richter J., Russo S., Schroeder A.J., Shu S.Q., 

RA Stapleton M., Yamada C, Ashburner M. , Gelbart W.M., Rubin G.M., 

RA Lewis S . E. ; 

RT "Annotation of the Drosophila melanogaster euchromatic genome: a 

RT systematic review."; 

RL Genome Biol. 3 : RESEARCH0083 . 1-RESEARCH0083 . 22 (2002 ) . 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Berkeley; TISSUE=Embryo; 

RX MEDLINE=22426066; PubMed=12537569; 

RA Stapleton M. , Carlson J.W., Brokstein P., Yu C, Champe M. , 

RA George R.A. , Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., 

RA Rubin G.M., Celniker S.E.; 

RT "A Drosophila full-length cDNA resource."; 

RL Genome Biol. 3 : RESEARCH0080 . 1-RESEARCH0080 . 8 (2002) . 

RN [5] 

RP FUNCTION. 

* RX PubMed=12765609; 

RA Freeman M.R., Del row J., Kim J., Johnson E., Doe C.Q.; 

RT "Unwrapping glial biology: Gem target genes regulating glial 

RT development, diversification, and function."; 

RL Neuron 38:567-580(2003). 

CC -!- FUNCTION: Receptor for netrin required for motor axon guidance. 

CC Mediates both short- and long-range axon motor repulsion in the 

CC developing nervous system upon ligand binding. Also involved in 

CC glial migration. While short-range repulsion requires both fra and 

CC unc-5, long-range repulsion only requires unc-5. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- TISSUE SPECIFICITY: Prior to gastrulation, it is strongly 

CC expressed in the presumptive mesoderm. Mesodermal expression 

CC begins to fade during stages 13-14, persisting only in the cells 

CC that form the dorsal vessel. Expressed within the CNS from late 

CC stage 13, shortly after the first axons have extended. Detected in 

CC several dispersed clusters of cells within the CNS, increasing in 

CC number as development proceeds. Also expressed in the peripheral 

CC and exit glia, which migrate laterally out of the CNS between 

CC stages 14 and 17. Strongly expressed in motor axons that exit the 

CC CNS ipsilaterally via the segmental nerve root (SN) . Not expressed 

CC on either commissural or longitudinal axons within the CNS, nor on 

CC motor axons that exit via the intersegmental nerve (ISN) . In the 

CC periphery, it is detected on all branches of the SN. Also 

CC expressed at high level in exit and peripheral glia along both the 

CC SN and ISN. 

CC -!- PTM: Phosphorylated on different cytoplasmic tyrosine residues (By 
CC similarity) . 

CC SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



-!- SIMILARITY: Contains 1 immunoglobulin- like C2-type domain. 
-!- SIMILARITY: Contains 1 immunoglobulin-like domain. 
-!- SIMILARITY: Contains 2 TSP type-1 domains. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; AF247762; AAF74193.1; 
EMBL; AE003811; AAF58143.2; 
EMBL; AY058501; AAL13730.1; 
HSSP; P07996; 1LSL. 
FlyBase; FBgn0034013; unc-5. 
GO; GO: 0007411; P:axon guidance; IGI . 
GO; GO: 0008347; P:glia cell migration; IMP. 
InterPro; IPR000488; Death. 
InterPro; IPR011029; DEATH_like. 
InterPro; IPR007110; Ig-like. 
InterPro; IPR003598; Ig_c2 . 
InterPro; IPR000884; TSP1. 
Pfam; PF00531; Death; 1. 
Pfam; PF00047; ig; 1. 
SMART; SM00005; DEATH; 1. 
SMART; SM00408; IGc2; 1. 
SMART; SM00209; TSP1; 2. 
PROSITE; PS50017; DEATH_DOMAIN; 1. 
PROSITE; PS50835; IG_LIKE; 1. 
PROSITE; PS50092; TSP1; 2. 

Developmental protein; Immunoglobulin domain; Phosphorylation; 



Receptor; 


Repeat; 


Signal; Transmembrane. 


SIGNAL 


1 


30 


Potential . 


CHAIN 


31 


1072 


Netrin receptor unc-5. 


DOMAIN 


31 


440 


Extracellular (Potential) . 


TRANS MEM 


441 


461 


Potential . 


DOMAIN 


462 


1072 


Cytoplasmic (Potential) . 


DOMAIN 


128 


224 


Ig-like . 


DOMAIN 


232 


314 


Ig-like C2-type. 


DOMAIN 


324 


379 


TSP type-1 1. 


DOMAIN 


398 


499 


TSP type-1 2. 


DOMAIN 


980 


1067 


Death. 


DISULFID 


149 


207 


By similarity. 


DISULFID 


253 


303 


By similarity. 


CARBOHYD 


79 


79 


N-linked (GlcNAc. . .) (Potential 


CARBOHYD 


300 


300 


N-linked (GlcNAc. . .) (Potential 


CONFLICT 


885 


885 


P -> S (in Ref . 1) . 


SEQUENCE 


1072 AA; 116416 


MW; A0A6B5A96B10138F CRC64; 


Query Match 




21.3%; 


Score 992; DB 1; Length 1072; 



Best Local Similarity 28.7%; Pred. No. 1.4e-65; 

Matches 282; Conservative 1.45; Mismatches 383; Indels 174; Gaps 



33; 



Qy 



11 PGANPDLLPHFLVTSPEDVTIVTOFKPVLLVCKAVP^ 70 
II : I I I I : I I I I : : I I I : I : I I I I : I : I I I : I : I I 



Db 



120 PGEASNTLPIFLIEPESVFWKNRPAVLKCKASHSLQVI FKCSGS-SQPPPSTHETHVDP 178 



Qy 71 SSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEP 130 

: I : I I : I I : : I I : I ': I I I I I I I I I I : I I : I I : I I I 
Db 179 HTGVNMEEVTATIHRDLVDEFFGDGPFKCECHAWSSRGVVKSQAATVHIAYIRKSFNQSP 238 

Qy 131 LAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNEDLVDPSLDPNWITREHSLVVRQARLM 190 

: : I I I I I I I :: I :| :: :| : :: :|: I I I 

Db 239 TSLRLELGSRAELRCEPPGGFPEPKLTWHKNNAVITADSEPGITVS-AGTLIFRQVALQH 297 

Qy 191 TANYTCVAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSA S CGRGWQKRS RS CTN 246 

I I I : I I : I I I I I II : I I I I I I I I I I I : I I III Nihil 

Db 298 MANYSCSAENIAGRRVSDSAVLIVYVNGGWSTWSPWRECKCAGKPSQGR KRSRTCNN 354 

Qy 247 PAP LNGGAFCEGQNVQKT ACAT LC P VD GSWSPWSKWSACGLDC 28 9 

I II I I I I I I : I I : I III I I I I I I I : I 

Db 355 PMPLNGGAQCPGPQIQKSADCAACPEDTQIVSPDGFDISSSKRMARWSAWSDWSICSAEC 414 

Qy 290 THWRSREC SDP 300 

I I : I II 
Db 415 I QVRRRKCLTQGQT Q I S S EAE EAGD LL L GAP GVGMAAL I AAAGVGAVG S P S EAT GS S S D I 474 

Qy 301 APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDV — ALYVGLIAV-AVCLVLLLLVLILV 357 

II I I I : I I : I I : I I I : I I : I I I : : 

Db 475 I PGYGKSLCAGKDIQTAECRGEQC Q I GKDD FDWT L YLGLAF I TAVC FAFGT ALI 528 

Qy 358 YCRKKEGLDSDV- — ADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCP 414 

I : I: :: |:: : : I : . I |: : I I 
Db 529 - CCARRGI RTN P H YNMARS VMDAD YMP - GWEKKEMRMHI EASNMGYDYV 576 

Qy 415 RQDGPSPKFQLTNGHLLS-PLGGGRHTLHH SSPTSEAEE 4 52 

I :: I I:: :| I I II III: 
Db 577 NPGHRY-LPGEHIMGMGIGCGGVTEHHYDVPNLSANYTNPIDHLSVDYLSETGESST 632 

Qy 453 FVSRLST — QNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI Y 510 

: I I I I : : : I I I I : I : : I : I I I : : 

Db 633 ADTSNSTFDMNGKLSILNASKSSTYEMLGSAGGQLRLYGGELLLFVPEHAIGKHVKKHVS 692 

Qy 511 LTLHKPEDVRLPLAGCQT LLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRL 566 

II II: II I I : I I : I I I I : I I : I : I : : 

Db 693 LLLLSDECSRV SCATESSILCSSWHSAPRNYSFVKPVTLKIPHC-LVAPEQWHVHI 748 

Qy - 567 KKQSCEG SWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAA 621 

I * : I : : I I I : : I I I I : : : I I I I I I : I I : 
Db 749 YHADSEHDELSVNWRRAVSVGEETINTPMFVQLEATHVFIMTEQLGHFTVVAEPRIQQPS 808 

Qy 622 KRLKLLLFAP-VACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSY 680 

: : I I I I : :: ::|:| : I :: :| :||| : I : I I : 

Db 809 IKMKLLAFSQHTPPSNANCSLRIYVVKDFPNSRDICA^A/EAKLGGSFLGESQVFAFTLNS 868 

Qy 681 HNLRLSIH — DVPSSLWKSKLLVSYQE-IPFYHIWNGTQRYLHCTFTLERVSPSTSDLAC 737 

II:: II:: I : I I : I I : Nihil : : I 
Db 869 RNLN I RVRS ADVEAA APYEHAI P YQHI LSNNS- 1 LHCEFSLRRQDQNS LC 917 

Qy 738 KLWVWQVEGDGQ S FS I N FN I TKDT RFAELLALES EAGVP A LVGPSA FKIPF 788 

: | | : | : | : : I I I I : : II : : I 

Db 918 VDFGQGSEDDYYTFNI PAHSMSGS — AEELASTTNTTI SI DRQGNYVNESCVMDFVQLPH 975 



Qy 789 LIRQKI I SSLDPPCRRGADWRTLAQKLHLDSHLS FFASKPS PTAMI LNLWEAR — HFPNG 846 

:: I :IMI III ll:||: I ::::||:| III I I I I I I I I 

Db 976 ATKRLICGALDPPRADERDWRLLAKKLNTDRYIAYFATKASPTEQILNLWECRANSSPGS 1035 

Qy 847 NL S QLAAAVAGLGQ P DAGL FTVS E 870 

: : :: : I I I : I 

Db 1036 SSNSVSHTIMAL LLTLKE 1053 



Search completed: March 1, 2005, 09:03:33 
Job time : 162.7 sees 



